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Preface 


The  papers  in  this  volume  represent  the  proceedings  of  the  Advanced  Seminar  on 
Applications  of  Bifurcation  Theory  held  in  Madison  on  October  27-29,  1976.  With 
the  exception  of  the  survey  by  M.  G Crandall,  the  papers  are  published  in  the  order 
in  which  they  are  presented. 

The  purpose  of  the  Advanced  Seminar  was  to  survey  a variety  of  applications  of 
bifurcation  theory  to  the  sciences.  There  is  a wealth  of  such  applications  and  the 
papers  contained  in  this  volume  discuss  bifurcation  problems  in  such  areas  as 
elasticity  theory,  fluid  dynamics,  geophysics,  astrophysics,  meteorology,  statistical 
mechanics,  and  chemical  kinetics.  There  are  also  two  papers  of  a more  mathemati- 
cal nature:  The  volume  begins  with  a discussion  of  analytical  methods  for  bifurca- 
tion problems  and  concludes  with  a paper  on  numerical  approaches  to  bifurcation 
questions. 

We  thank  the  authors  for  their  efforts  to  meet  our  manuscript  deadlines  and  thank 
Doris  Whitmore,  who  put  the  volume  together  and  compiled  the  index.  We  are 
especially  grateful  to  Gladys  Moran,  who  so  ably  attended  to  the  details  and  paper 
work  associated  with  the  Conference.  Finally,  we  are  indebted  to  the  Army  Re- 
search Office  under  Contract  Number  DAAG29-75-C-0024,  the  National  Science 
Foundation  under  Grant  Number  MCS76- 12286,  and  the  Office  of  Naval  Research 
under  Contract  Number  NOOO14-76-M-0036  for  their  financial  support,  which 
made  possible  the  Advanced  Seminar  and  this  volume. 

Paul  H.  Rabinowitz 
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An  Introduction  to  Constructive  Aspects 
of  Bifurcation  and  the  Implicit  Function 
Theorem 

M.  G.  Crandall 


The  goal  of  the  lecture  to  which  this  article  corresponds  was  to  re- 
view some  basic  constructive  results  in  bifurcation  theory  from  the  point 
of  view  used  by  the  author  and  P.  Rabinowitz  in  [3-5].  Since  the  exist- 
ing literature  seems  quite  adequate  in  this  regard  (see,  e.g.,  [3-5],  [16] 
and  the  review  article  [6])  there  is  little  point  in  merely  pursuing  the 
same  goal  here.  Thus  a major  portion  of  the  current  paper  is  devoted  to 
presenting  introductory  material  which  may  prove  useful  to  certain  pro- 
spective readers  of  the  above  literature.  The  contents  of  this  paper  are 
outlined  below. 

Section  1 attempts  to  describe  in  a general  way  what  bifurcation 
theory  is  concerned  about  with  emphasis  on  the  parts  of  greatest  Interest 
to  us.  The  style  is  informal  and  the  attempt  is  addressed  to  both 
mathematicians  and  nonmatnematlcians. 

Section  3 introduces  the  simple  methods  and  some  of  the  results  of 
[ 3 ],  [4]  and  [5],  For  those  who  already  know  the  terms,  the  main  results 
concern  bifurcation  from  "simple  eigenvalues",  Hopf  bifurcation  and 
stability.  The  exposition  is  brief  and  things  are  presented  somewhat 
differently  than  elsewhere.  The  goal  is  to  offer  a sampling  adequate  for 
the  reader  to  Judge  whether  or  not  he  wishes  to  pursue  these  topics 
further. 

Section  2 is  an  attempt  to  explain  and  to  sell  the  principal  analyti- 
cal tool  used  in  Section  3 to  potential  users  of  this  tool  - the  implicit 
function  theorem  in  Banach  spaces.  Many  people,  appreciating  the 
amazing  scope  and  simplicity  of  this  theorem,  have  tried  to  propogate  its 
use  with  missionary  zeal.  (As  Paul  Rabinowitz  commented  after  my  lec- 
ture - "Nice  sermon  - er,  lecture". ) It  is  strange  then  that  this  cause 
is  not  further  advanced  a half-century  after  these  tools  became  easily 
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available.  Section  2 is  completely  independent  of  Section  1 and  3 and  is 
our  first  written  contribution  (for  better  or  worse)  to  the  cause. 

Lack  of  space  and  our  limited  goals  precluded  the  discussion  of 
serious  concrete  examples  in  this  paper.  We  have  restricted  ourselves 
principally  to  general  settings  and  a few  illustrative  examples.  There  is 
no  lack  of  good  examples.  We  suggest  this  volume,  [3],  [7],  [12],  [13], 
[14]  and  [22]  as  sources. 

Other  expository  sources  covering  various  aspects  of  bifurcation 
problems  (many  not  touched  upon  here)  are  [16],  [17],  [18],  [19],  [20]  and 
[24].  Another  point  of  view  is  taken  in  [2]. 

The  current  paper  is  not  an  appropriate  place  to  be  very  concerned 
about  references.  The  sources  listed  above  and  the  rest  of  this  volume 
provide  access  to  most  of  the  literature. 

Section  1.  Introduction 

The  mathematical  models  of  science  and  engineering  may  often  be 
written  in  the  form 

(1. 1)  jjjr  + G(w)  = 0 

where  w is  the  state  of  the  system  under  consideration.  We  are  think- 
ing of  (1. 1)  in  a very  general  way.  If  the  "system"  under  consideration 
is  a bacteria  culture,  the  "state"  might  be  the  number  or  mass  of  the 
bacteria.  The  simple  model  of  this  situation  presented  in  beginning  cal- 
culus reads  dw/dt  = kw  for  some  constant  k,  so  here  G:  ]R  -*•  1R  and 
G(w)  = -kw  . In  this  example  the  state  is  a number.  If  one  considers 
a system  of  particles,  the  state  will  be  a list  of  numbers  determining  the 
positions  and  velocities  of  each  particle.  Many  of  the  problems  we  are 
interested  in  have  "states"  which  are  not  lists  of  numbers  but  functions. 
For  example,  in  some  models  of  a fluid  confined  in  a region  [!C  IRJ  the 
state  of  the  fluid  at  time  t is  taken  to  be  a function  w(t,  x)  = (w^t,  x)  , 
w2(t,  x),  w^(t,  x)),  x = (Xj,  x2,  x^),  which  tells  the  velocity  of  the  fluid 
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at  x « ft  time  t . Then  (1.1)  represents  a complicated  system  of  eoua- 
tions  involving  partial  differentiations  and  other  operations  well  as 
boundary  conditions.  We  will  not  explicitly  discuss  such  complicated 
[ models  here  - they  appear  elsewhere  in  this  volume.  But  let  us  indicate 

1 how  to  rewrite  a Droblem  in  partial  differential  eauations  in  the  form 

(1. 1). 


(1.2) 


For  this,  consider  the  example  problem 


r 

9w  9^w 

(DE) 

~dt  ' ~2  + f<X’W> 

9x 

9w  „ 9w  . 

/ ( BC ) 

-^(°,t)  = — (1,t). 

(IC) 

V- 

w(x,  0)  = wQ(x)  , 

= 0,  t > 0,  0 < x < 1 

0 , t > 0 

0 < x < 1 


consisting  of  the  differential  equation  (DE),  the  boundary  condition 

(BC)  and  the  initial  condition  (IC)  . Here  the  unknown  function  w(x,  t) 
2 

is  sought  and  r : 1R  -*  1R  is  a real-valued  function  of  two  real  variables. 
The  parameter  \ is  introduced  for  later  use.  Right  now  think  of  (1.  2)  as 
a problem  for  w with  X.  fixed.  If  we  change  X we  get  a different 
problem.  (1.2)  corresponds  to  a system  whose  state  at  time  t is  w(x,  t) 
as  a function  of  x.  Let  us  think  of  this  w as  the  mapping  t — w(*,t) 
where  w(*,t)  is  the  function  x — w(x,  t)  . We  then  write  w(t)  rather 
than  w(x,  t)  . Now  w(t)  is  itself  a function  for  fixed  t and  w(t)  cor- 
responds to  w(x,  t)  via  w(t)(x)  = w(x,  t)  . Next  (for  each  X)  define  a 
function  G(X,  •)  on  functions  of  x by 

.2 

(1.  3)  G(X,  w)  = - — — w + r(X,  w)  . 

dx 


To  each  suitable  function  w = w(x)  (1.  3)  associates  a new  function 
G( X,  w)  . Moreover,  we  incorporate  the  conditions  (BC)  into  the  defini- 
tion of  G . This  is  done  by  restricting  the  domain  of  G . Let  us  agree 
that 
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(1.  4)  domain  of  G = D(G)  = {w  « C ([0, 1]):  w'(0)  = w'(l)  = 0}  . 

2 

Here  C ([0,1])  means  the  set  of  twice  continuously  differentiable  functions 

on  [0,1],  Corresponding  to  regarding  w(x,  t)  as  w(t),  we  rewrite 
0 d 

— w(x,  t)  as  w(t)  . (To  actually  be  precise  about  this  we  have  to 
say  what  the  allowable  values  w(t)  are,  that  is  what  states  we  are  go- 
ing to  consider.  Here  we  arbitrarily  agree  that  states  should  be  contin- 
uous functions  of  x,  that  is  w(t)  « C([0, 1])  . Then  ^ (tQ)  = 
f « C([0, 1])  simply  means  that  the  function  h'*(w(tQ +h)(x)  - w(t  )(x)) 
converges  to  f(x)  uniformly  for  x « [0, 1]  as  h — 0 . The  whole  prob- 
lem (1.  2)  can  formally  be  rewritten  as 


(1.  5) 


(DE)  ^ + G(\,  w)  = 0,  t > 0 
|jIC)  w(0)  = wQ  . 


The  (DE)  of  (1.5)  includes  the  (DE)  of  (1.2)  and  (BC)  which  is  in- 
corporated via  (1.4).  We  agreed  not  to  plug  w into  G(X,  •)  unless  it's 
derivative  with  respect  to  x vanishes  at  0 and  1 . 

Now,  you  may  well  (and  should)  complain  that  you  don't  know 
what  (1.  5)  "means".  To  say  precisely  what  (1.  5)  means  is  to  define  pre- 
cisely what  is  meant  by  a solution  of  (1.  5)  (and  the  same  is  true  for  (1.  2) 
to  begin  with).  This  is  a more  serious  problem  if  one  works  with  other 
state  spaces  in  the  current  example  (e.  g. , L*(0, 1))  or  more  complex 

equations  (e.  g. , problems  in  fluid  dynamics).  We  sidestep  this  question 
here  by  staying  mostly  in  an  abstract  setting.  This  volume  and  the  ref- 
erences herein  provide  access  to  an  enormous  spectrum  of  significant 
concrete  models  of  all  levels  of  complexity.  We  do  caution  the  reader 
that  only  special  choices  of  state  spaces  are  either  physically  or  math- 
ematically reasonable  in  a given  problem.  It  can  also  easily  happen 
that  a mathematically  convenient  choice  may  exclude  physically  signifi- 
cant possibilities. 
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It's  time  to  discuss  the  appearance  of  parameters.  Each  mathemat- 
ical model  contains  (implicitly  or  explicitly)  parameters  corresponding  to 
the  real  world  situation  being  described.  For  example,  the  outcome  of  a 
bacteria  growing  experiment  will  certainly  depend  on  the  size  of  the  ex- 
perimental apparatus,  the  temperature,  the  composition  of  the  ambient 
atmosphere  and  other  parameters,  some  of  which  may  depend  on  time. 

To  accomodate  all  of  these  we  would  have  to  have  a model  like  dw/dt  + 
G(\j,  X.,,  . . . , X^,  t,  w)  = 0 with  a large  number  k of  parameters.  How- 
ever, even  the  biologists  would  be  unable  to  figure  out  how  G should 
depend  on  all  these  quantities.  Thus  they  might  try  to  fix  as  many  as 
possible  and  perhaps  vary  one  so  as  to  see  the  effect  of  this.  Say 
several  experiments  are  run  with  all  parameters  fixed  in  each  experiment 
and  only  the  temperature  varied  between  experiments.  Then  one  is  con- 
sidering dw/dt  + G(T,  w)  = 0 . This  is  typical  - there  are  in  fact  always 
many  parameters.  To  get  a tractable  problem,  it  is  assumed  that  many 
are  fixed  and  a very  few  are  varied  between  experiments.  Whether  this 
is  reasonable  in  a given  case  is  not  a mathematical  problem  - it  depends 
on  the  information  being  sought  about  the  system  and  upon  the  agreement 
between  theory  and  experiment.  Many  times  several  parameters  in  a 
model  can  be  lumped  into  a single  one  by  scaling  procedures  - these 
then  often  get  names  like  Reynold's  number,  etc. 

Thus  we  consider  an  "evolution  problem" 

(1.6)  ^ + G(\,w)  = 0 

which  depends  on  parameters  X . We  allow  X to  be  a vector  and  hence 
represent  many  parameters.  The  simplest  solutions  (1.6)  can  have  are 
equilibrium  solutions.  These  are  time-independent  solutions  of  (1.6), 
i.  e.  the  states  which  satisfy  G(X,  w)  = 0 . "Bifurcation  theory"  con- 
cerns itself  with  the  study  of  the  problem  G = 0,  the  stability  of  equil- 
ibrium solutions  of  (1.  7)  and  the  study  of  (1.  7)  itself  especially  in  neigh- 
borhoods of  points  where  the  structure  of  the  set  of  solutions  under  con- 
sideration somehow  changes.  This  is  necessarily  vague  - "bifurcation 
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theory"  is  not  a very  unified  subject  of  study.  The  results  of  Section  J 
are  motivated  by  examples  in  which  the  "change  of  structure"  happens  as 
described  below. 

Assume  there  is  a known  curve  w = #>{X)  of  solutions  of  G(X,  w)  = 0. 
This  means  that  <p(\)  is  an  admissible  state  for  each  V and  G(  X,  <p(  \ » = 0 . 
Next  change  variables  in  (1.  6)  according  to  u + <^( \)  = w : 

du  dw 

= JjjT  = -G(\,  w)  = -G(X,  u + </>(M) 


or 

(1.7)  ^rF(k,u)  = 0 

where  F(X,  u)  = G(X,  u + <p(X))  satisfies 
(1-  8)  F( X,  0)  = 0 . 

For  example,  if  G is  given  by  (1.  3),  (1.4)  and  r(X,  w)  = -X(w-X)  + 

2 

(w-X.)  w,  then  ip(\)  could  be  the  constant  function  X (constant  states 

2 

are  admissible  in  (1.4)).  If  u = w - X,  then  F(X,  u)  =-—  - Xu  + 

2 j 2 

u (u+X)  on  the  domain  of  F . ax 

We  call  the  solutions  u = 0 of  (1.  7)  the  trivial  solutions.  It  is 
always  interesting  to  study  the  stability  of  a known  solution  of  an  evolu- 
tion problem.  Roughly,  0 is  a stable  solution  of  (1.  7)  (X  is  now  re- 
garded as  fixed)  provided  that  we  can  keep  the  solution  of  (1.  7)  as  close 
to  0 as  we  desire  for  all  t > 0 by  choosing  w(0)  sufficiently  small 
(in  some  sense)  and  0 is  an  asymptotically  stable  solution  of  (1.7)  if  it 
is  stable  and  we  can  also  guarantee  that  u(t)  -*  0 as  t — °°  by  choosing 
u(0)  sufficiently  small.  If  we  are  in  finite  dimensions  (i.  e.  u c ]RN  and 
F:  H X]RN-]RN)  for  some  k and  N,  it  is  well-known  that  the  zero 
solution  of  (1.  7)  is  asymptotically  stable  if  the  eigenvalues  of  F (X,  0) 
(the  Jacobian  matrix  (8Ft/8u^)  evaluated  at  X,  u = 0)  all  have  positive 
real  parts  and  is  unstable  If  F^x,  0)  has  an  eigenvalue  with  a negative 
real  part.  If  we  are  in  "infinite  dimensions",  as  is  the  case  when  (1.  7) 
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corresponds  to  a problem  in  partial  differential  equations,  analogues  of 
this  remain  true.  One  has  a "linearized  operator"  F^X,  0)  (see  the  next 
section)  at  the  trivial  solutions  and  its  eigenvalues  (or,  more  precisely, 
t spectrum)  often  corresponds  in  the  above  way  to  the  stability  of  the 

J trivial  solution.  The  assumption  that  this  is  so  is  called  the  "principle 

of  linearized  stability". 

Assume  now  that  X is  one  real  parameter  and  there  is  a critical 
parameter  value  X^  such  that  u = 0 is  stable  for  X < X^  and  is  un- 
stable for  X > X . This  information  may  be  pictured  as  being  obtained 
via  the  principle  of  linearized  stability  and  knowledge  of  the  spectrum  of 
the  linearized  operator  or  from  experiments  in  which  u = 0 is  observed 
for  X < XQ  but  is  not  observable  for  X > XQ  . The  problem  discussed  in 
Section  3 is  to  figure  out  a little  about  what  things  look  like  near  X = XQ 

and  u = 0 . Since  F (X.  0)  has  spectrum  with  positive  real  part  for 
u 

X < XQ  and  some  of  its  spectrum  has  nonpositive  real  part  if  X > XQ,  it 
will  in  general  have  some  spectrum  with  zero  real  part  if  X = XQ  . When 
0 (respectively,  ±i)  is  in  the  spectrum  of  Fu(Xfl,  0)  and  certain  "sim- 
plicity" conditions  are  satisfied,  the  methods  of  Section  3 show  that 
there  are  equilibrium  (respectively,  periodic)  solutions  of  (1.7)  near 
X = x , u = 0 which  can  be  constructively  obtained  and  which  account 
for  the  stability  lost  by  the  trivial  solution  as  X passes  through  XQ  . 

Section  2.  Implicit  Functions  - A Discussion  for  Potential  Users 

The  number  of  methods  which  are  available  to  solve  nonlinear 
problems  is  small,  especially  if  one  refuses  to  regard  a nonconstructive 
• existence  proof  as  a "solution"  of  the  problem  being  treated.  Perhaps 

the  two  methods  of  broadest  scope  and  success  are  perturbation  expan- 
sions and  iterations.  (We  hope  the  reader  has  seen  at  least  one  power 
series  expansion  and  one  Iteration  scheme  somewhere  before.  ) There 
are  three  main  steps  in  trying  either  method  on  a particular  problem.  The 
first  is  to  decide  what  scheme  (i.  e.  form  of  expansion  or  iterations)  to 
try.  Given  a candidate  scheme,  the  next  problem  is  to  show  it  is 
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defined  (i.  e.  the  successive  iterates  or  terms  in  the  expansion  can  be 
found).  Finally,  there  is  the  error  estimation  problem  (i.e. , to  show  how 
close  the  iterates  or  partial  sums  of  the  expansion  are  to  a true  solution). 
All  of  these  steps  may  be  substantial.  It  is  unfortunate  that  abstract 
mathematics  tends  to  ignore  the  first  one.  We  teach  the  contraction 
mapping  principle,  but  do  little  to  explain  how  to  convert  given  problems 
into  contraction  mapping  problems.  One  of  our  main  points  in  this  sec- 
tion is  that  the  implicit  function  theorem  in  Banach  spaces  can  be  regard- 
ed in  this  way;  It  says  "If  you  can  verify  this  condition,  the  following 
iterates  will  work  by  the  contraction  mapping  principle.  " It  contains  in- 
structions on  how  to  convert  problems  into  contraction  mapping  problems. 
Theorems  that  do  this  are  worth  knowing.  Of  course  it  says  much  more. 
One  of  the  other  things  it  says  is  that  if  the  problem  is  analytic  and 
certain  conditions  hold,  then  the  solution  is  analytic.  Hence  perturba- 
tion expansions  will  converge.  It  also  says  that  one  can  compute  the 
expansion  coefficients  by  implicit  differentiations,  which  is  an  efficient 
way  to  keep  track  of  the  algebra.  And  the  proof  of  all  this  is  quite  sim- 
ple. The  plan  below  is  to  first  restate  and  expand  some  of  the  above  re- 
marks in  terms  of  a simple  example  in  an  attempt  to  make  things  seem 
more  concrete.  After  this  we  discuss  differentiation,  compute  some  typi- 
cal derivatives  and  then  informally  state  and  prove  the  implicit  function 
theorem  in  Banach  spaces. 

Say  we  are  lead  to  the  simple  model 

2 2 

u - u = cos  x + cos  x - 1,  °lxlTr» 

u(0)  - u(it)  = 0 , 

while  considering  some  problem.  Someone  cleverly  observes  that  u=  sinx 
is  a solution.  Assume  that  perturbing  the  parameters  which  led  us  to 

(2. 1)  yields 

r 2 2 

U'  - (lt(i)u  = cosx  + cos  x-1  + 6 0<x<ir 

(2.2)  < ~ ~ 

l^u(O)  = u(ir)  = 0 
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and  we  want  to  know  about  "the  solution"  of  (2.  2)  for  (p,  6)  near  (0,  0) 
if  there  is  such  a solution.  The  existence  is  not  entirely  obvious.  For 
example,  if  6 > 2 and  p > -1  then  (2.  2)  implies  u*  > 0 on  0 < x < it 
and  u(0)  = u(tt),  which  is  impossible.  In  fact,  there  are  no  solutions 
in  the  range  6 > j,  p > -1  . 

A standard  way  to  look  for  a solution  is  to  make  the  ansatz 
(2.3)  u = uQ  + p Q + 6U(M  + p2  u2>  Q + p6Ul>  j + 62  uQ>  2 + . . . 


with  uQ  = sinx  . Plugging  (2.  3)  in  (2.2)  and  equating  coefficients  of 
j k 

like  monomials  p 6 on  either  side  leads  to  equations  for  the  u . 

f,  m 

To  first  order  these  are 


ru\  - 2u.  u,  = u2  + f. 

1,0  0 1, 0 0 0 


(2.4) 


u'  , - 2u„ u , = 1 

0, 1 0 0, 1 


0 < X < TT 


0 < x < IT 


2 

where  f = cosx  + cos  x - 1 . Each  of  these  linear  problems  is  to  be 

solved  subject  to  the  requirement  u,  (0)  = u.  (tt)  . This  is  easily 

1 , m I , m 

done  and  u , u can  be  expressed  explicitly  in  terms  of  integrals. 

’ 1>  " 0,  1 

Moreover,  it  is  not  hard  to  see  that  the  remaining  u . can  be  obtained 

i , m 

in  a similar  way  by  solving  problems 


Lu 


= u'  - 2u  u = f 
I , m l,m  0 I , m t , m’ 


(2.  5) 


u#  m(°)  = u(  J")  » 

/ , m l , m 


0 < x < rr  , 


where  f is  a function  involving  f and  the  u.  . for  j + k < i + m , 
which  are  determined  at  previous  steps.  This  method  (a  very  simple  per- 
turbation expansion)  yields  a formal  series  (2.  3)  for  the  solution  of  (2.2). 
If  the  series  converges  well  enough  it  will  certainly  solve  (2.2).  Does 
it  converge?  In  what  region?  Considerable  work  is  Involved  in  studying 
these  questions  directly.  If  the  answers  to  them  are  satisfactory,  we 
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may  want  to  know  more,  for  example  the  parameter  range  In  which  the 
estimate  |u  - (uQ  + p Uj  Q + 6 uQ  | < . 01  for  0 < x < n Is  valid. 

There  are  many  other  possibilities  we  could  try.  For  example,  de- 
fine the  Iterations 

x 

(2.6)  UJ+1(X)  = CJ+1  + / (f0(s)  + 6 + (1+H)  u}(s)  )ds 

where  uQ  = slnx,  cQ  = 0 and  the  c^'s  are  to  be  determined  by  the  con- 
ditions 

TT 

(2.7)  / (f  (s)  + 6 + (l  + p)(u  (s))2)ds  = 0 

0 1 

which  are  necessary  In  order  that  uj+1<0)  = uj+1(u)  • Next  is  neces- 
sary to  see  If  this  Iteration  scheme  Is  defined,  that  Is  If  (2.  7)  can  be 
solved  at  each  stage  In  some  parameter  range.  If  so,  It  still  remains  to 
study  the  convergence. 

Another  Iteration  scheme  can  be  set  up  as  follows.-  Let  L be  the 
operator  corresponding  to  (2.1)  linearized  at  uQ  = slnx,  1.  e. 

I Lu  = u'  - 2UgU 

(2-8)  < 

I u(0)  = u(ir)  . 

We  can  always  uniquely  solve  Lu  = f,  u(0)  = u(ir)  when  f Is  given  on 
[0,  it]  . You  give  the  formula.  (We  do  quickly  note  that  Lu  = 0,  u(0)  = 
u(tr)  implies  u = 0 . Indeed  u(0)  = u(tt)  Implies  u'(xQ)  = 0 for  some 
x0  * (0,  it)  . Then  Lu  = 0 implies  u'(xQ)  - 2(slnxQ)  u(x^)  = 0,  so 
u(xQ)  Is  also  0 . By  uniqueness  of  solutions  of  Initial  value  problems, 
u = 0 . ) Denote  the  solution  of  Lu  = f,  u(0)  = u(ir)  = 0 by  u = Kf  . 

Then  consider  the  Iterations 

(2.9)  uJ+1  = Uj  - K(uj  - (l  + p)Uj2  - fQ  - 6) 

with  uQ  as  before. 
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There  are  several  points  to  be  made.  The  first  Is  that  there  Is  no 
unique  approach  to  (2.  2).  Next  the  program  (2.  3) -(2.  5)  requires  the 
functions  Involved  to  have  derivatives  of  all  orders.  In  this  case  it  will 
converge  In  some  (p,  6)  region  containing  (0,  0),  but  to  prove  It  and 
oet  estimates  would  be  messy.  We  haven't  checked  what  happens  with 
the  scheme  (2.  6) -(2.  7).  The  iteration  scheme  (2.9)  Involves  Inverting 
the  same  linearized  operator  as  (2.  3)-(2.  5).  It  will  converge  In  some 
(p,  6)  region.  The  method  of  (2.  9),  which  we  have  not  yet  Identified, 
does  not  require  the  existence  of  higher  order  derivatives.  So  what  is 
the  "method"?  Iterations  are  used  - but  how  are  these  iterations  chosen? 
What  guarantees  the  convergence?  The  way  in  which  many  Iteration 
schemes  converge  is  well  summarized  by  the  contraction  mapping  princi- 
ple, but  this  principle  does  not  explain  how  to  set  up  an  Iteration  scheme 
involving  contraction  mappings  in  a given  problem.  The  implicit  function 
theorem  in  fact  does.  The  implicit  function  theorem,  as  applied  to  (2.2) 
at  the  known  solution  p = 6 = 0,  uq  = slnx  tells  one  to  use  the  itera- 
tions (2.  9),  that  the  right  hand  side  of  (2.  9)  is  a contraction  mapping  in 
Uj  in  a suitable  metric  space  and  gives  estimates  all  at  once.  More- 
over, (in  appropriate  versions),  it  also  tells  one  how  to  compute  the 
u of  (2.  3)  in  an  efficient  way  and  a sense  in  which  the  series  there 
will  converge. 

In  order  to  use  this  very  efficient  packaging  of  contraction  mappings, 
iteration  schemes  and  simple  perturbation  expansions  by  the  implicit 
function  theorem  one  must  supplement  the  "linearized"  gadgets  of  ap- 
plied mathematics  with  a little  more  information  to  be  able  to  call  them 
Frechet  derivatives.  This  is  not  a big  deal,  since  to  upgrade  a linearized 
gadget  to  a Frechet  derivative  one  must  only  give  adequate  quantitative 
information  about  the  sense  in  which  it  is  a good  linear  approximation. 
When  one  knows  how  to  do  this,  what  the  implicit  function  theorem  says 
and  some  facts  of  linear  analysis,  he  discovers  an  incredible  array  of 
very  useful  results  which  may  be  understood  quickly  in  terms  of  the  im- 
plicit function  theorem.  A few  of  these  relevant  for  this  article  are 
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bifurcation  from  simple  eigenvalues,  the  Liapunov  - Schmidt  procedure, 
perturbation  of  simple  eigenvalues  and  Hopf  bifurcation.  (There  are  many 
more  - Picard  existence  theorem,  conditional  stability,  etc.,  etc.) 
Frechet  derivatives  and  the  implicit  function  theorem  are  explained  be- 
low. 

We  originally  thought  to  begin  with  the  definition  of  a Banach 
space  and  work  as  quickly  as  possible  to  the  Implicit  function  theorem. 
This  Is  a surprisingly  short  program  _ but  it  does  not  seem  optimally 


useful.  The  presentation  In  [21]  is  close  to  this  If  one  reads  it  ignoring 
the  distracting  array  of  definitions  (assume  every  space  In  sight  is  a 
Banach  space,  which  Is  defined  there).  See  also  [9].  (We  like  the  dis- 
cussion in  [1]  even  If  it  Is  not  so  quick. ) What  Is  lacking  In  [21]  Is  ex- 
amples. So  we  have  compromised  by  assuming  the  definition  of  a Banach 
space  and  then  defining  Frechet  derivatives,  discussing  the  calculation 
of  Frechet  derivatives  in  applications,  Informally  stating  the  implicit 
function  theorem  and  giving  its  proof. 

If  f:  1R  -*  1R,  the  derivative  f'  of  f is  the  function  defined  by 


provided  the  limit  exists.  Equivalently,  f = f * (tQ)  provided  that 


(2.11) 


f(tQ  + h)  = f(tQ)  + fh  + o(h) 


where  o(h)  stands  for  any  function  satisfying  lim  o(h)/|h|  = 0 . The 
definition  (2. 10)  is  computational  - it  provides  a way  to  compute  f'  . 
The  definition  (2. 11)  is  often  easier  to  work  with  (e.  g. , in  proving  the 
chain  and  product  rules)  and  introduces  the  idea  that  derivatives  are 
"linear  approximations".  For  example,  the  proof  of  the  chain  rule  reads 
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f(g(tQ  + h))  = f(g(tQ)  + g'(t0)h  + o(h)) 

= f(g(t0))  + f'(g(t0)  (g’(t0)h  + o(h» 

+ o(g'(t0)h  + o(h)) 

= f(g(t0))  + f'(g(t0))  g*(t0)h  + o(h) 

since  f'(g(t  ))o(K)  + o(<g'(t0)h  + o(h))  = o(h)  . 

If  f:  F -*  V where  V Is  a Banach  space,  both  (2.10)  and  (2.11) 
make  perfect  sense  and  remain  equivalent.  The  limit  in  (2.10)  and  o(h) 
in  (2. 11)  now  refer  to  the  way  things  are  measured  by  the  norm  ||  ||  of 
V . If  f*.  U -*  V where  U and  V are  Banach  spaces  we  cannot  write 
(2. 10)  directly.  A natural  generalization  is  the  notion  of  directional  de- 
rivative. Define  the  derivative  of  f at  uQ  <=  U in  the  direction  u c U 
by 

A f(u  +hu)-f(u) 

(2.12)  f(u  ; u)  = 11m  }j 

h«  F 
h— 0 

provided  the  limit  exists.  This  notion  is  used  in  defining  "Gateaux  de- 
rivatives". It  has  the  weakness  that  it  does  not  involve  the  way  things 
are  measured  in  U . The  limit  on  the  right-hand  side  is  taken  in  V , so 
||  ||  is  involved,  but  ||  ||  is  not.  The  natural  generalization  of  (2. 11) 
involves  both  ||  ||  and  ||  Hv  . 

Definition  . A continuous  linear  mapping  T:  U -*  V is  the  Frechet  de- 
rivative of  the  (nonlinear)  function  fs  U -*  V at  uQ  « U if  for  every 
e > 0 there  is  a 6 > 0 such  that  u « U and  II u || ^ < 6 Implies 
II f(uQ  + u)  - (f(uQ)  + Tu) || v < ellully  . 

Be  patient  - the  definition  will  gradually  become  clearer  and  more 

* 

transparent.  We  summarize  it  by  saying  that  T is  the  Frechet  deriva- 
tive of  f at  uQ  provided  that  f(uQ  + u)  = f(uQ)  + Tu  + o(u)  . One  says 

"f'(u  ) exists"  or  "f  is  Frechet  differentiable  at  u " if  there  is  such 
0 0 
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a T and  writes  T = f ’ (UQ)»  f'  is  the  mapping  uQ  ■*  f'(uQ)  defined  on 
the  set  of  uQ  a U such  that  f ’ (uQ)  exists.  The  values  of  f'  are  con- 
tinuous linear  mappings.  Implicit  in  these  remarks  is  the  easily  checked 
fact  that  f can  have  at  most  one  Frechet  derivative  at  any  u^  . In  fact, 
clearly  f’(u  )u  = f'(u0  ;u ) so  f ' ( UQ)  can  be  computed  by  (2. 12)  provided 
f'(u0)  exists. 

Now  we  consider  the  computation  of  derivatives.  In  applications 
in  analysis  there  are  two  common  types  of  things  v-e  need  to  differentiate. 
These  are  linear  mappings  and  substitution  operators.  The  differentia- 
tion of  continuous  linear  mappings  is  easy.  If  T:  U -*  V is  continuous 
and  linear,  then  T(uQ  + u)  = TuQ  + Tu , so  T'(u  ) = T for  every  uQ  . 

The  Frechet  derivative  of  a continuous  linear  mapping  is  itself  at  each 
point.  This  needs  to  be  reconciled  with  the  one  dimensional  case.  If 
f(t)  = 2t  for  t « IR  we  usually  write  f‘  = 2 . f'  is  the  constant  function 
2,  which  does  not  seem  to  be  f . But  the  derivative  of  a continuous 
linear  function  is  itself  - what  is  going  on  ? Go  back  to  (2. 10)  and  (2. 11). 
The  Frechet  derivative  of  f(t)  = 2t  at  t is  the  linear  mapping 
h -*  f'(tQ)h  = 2h,  which  is  f at  each  t . More  generally,  if  f;  ]R  -*  V 
then  f‘(t  ) can  be  thought  of  as  an  element  of  V computed  via  (2.10) 
or  as  the  linear  mapping  h -*  hf' (tQ)  defined  by  this  element  of  V . As 
a V-valued  function  f can  be  constant  even  if  its  values  regarded  as 
linear  mappings  are  not  constant  mappings.  Both  ways  of  thinking  are 
useful  in  this  special  case  (i.  e.  f'  has  values  in  V or  in  the  contin- 
uous linear  mappings  from  IR  into  V)  and  we  use  them  interchangeably. 

Let  us  go  on  to  substitution  operators.  Assume  now  that 
g:  lRn  — ]Rm  . Given  a set  ft  and  a function  u;  ft  -*  lRn  we  define  a new 
function  G(u):  ft  -»>  lRm  by 

(2.13)  G(u)(x)  = g(u(x))  for  x«  ft  . 

G is  a substitution  operator.  Assume  the  first  partial  derivatives  of  g 

0 

are  continuous  and  let  g (u)  denote  the  Jacobian  matrix  (- — g (u  , . . . , 

u ^ — u . 1 t 

u^))  evaluated  at  u = (u^, . . . , u ) t IR  . By  calculus  (Taylor's  theorem), 
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* n 

If  u,  u:  J2-*  1R  and  x«  ft  are  fixed,  then 

1 A 

lim  — (G(u  + hu  )(x)  - G(u)(x)) 
h-0  h 

= llm  q(u(x)  t-hu(x))  - g(u(x)) 
h-*0  h 

= 9u(u(x))  u(x)  . 

Thus  we  expect  the  Frechet  derivative  of  G at  some  uQ  to  be  the  linear 
operator  "multiply  u(x)  by  gu(u0(x))"  . That  is,  we  expect 

(2.14)  (G'(u0)u)(x)  = gu(uQ(x))  u(x)  . 

Now  is  the  time  to  make  the  following  point:  There  is  no  doubt  that  (2. 14) 
is  the  right  way  to  linearize  G at  uQ  . There  is  almost  never  any  doubt 
about  what  the  "linearized  operator"  should  be.  The  formulas  for  "lin- 
earized operators"  do  not  depend  on  functional  a sis  or  what  Banach 
spaces  are  involved  at  all.  One  finds  the  formulas  easily.  But  to  say  a 
linearized  operator  is  a Frechet  derivative  is  a statement  about  how 
small  errors  are,  and  the  usual  ways  of  measuring  errors  introduce  the 
norms  of  Banach  spaces.  To  claim  G'(uQ)  exists  and  is  given  by  (2.14) 
we  must  say  what  the  spaces  are  and  verify  estimates.  Let  ft  be  a 
closed  bounded  subset  of  some  Euclidean  space.  Let  C(ft:  1R  ) denote 
the  Banach  space  of  continuous  functions  u:  ft  ]R  equipped  with  the 
norm 

llu||  = sup  II  u(x)  ||  , . 

C(fts]R  ) xtft  J? 

Regard  G as  a map  from  C(D:lRn)  to  C(fi:IRm)  . If  the  first  partials  of 
g are  continuous,  then  G'  i£  given  by  (2.14).  To  prove  this  we  must 
show  that  given  e > 0 there  is  a 6 > 0 such  that 


16 


M.  G.  CRANDALL 


sup  ||g(uQ(x)  + G(x))  - (g(uQ(x))  + gu(uQ(x))  u(x))||  m 
xe  J2  ]R 

< e sup  It  u(x)  ||  n 
x«  S2  ]R 

provided  sup  ||u(x)||  ^ < 6 . Since  uQ(x)  ranges  over  a compact  sub- 
xc  ft  ]R 

set  of  lRn  as  x ranges  over  S2,  this  follows  easily  from  Taylor's 
theorem. 

For  example,  if  G : C([0, 1]:  ]R2)  -*  C([0, 1]:1R)  is  given  by 
G(u1,u2)(x)  = Uj(x)  u2(x)  + sin  ut(x),  then  (G'fu^,  uQ2)u  ) (x)  = 

U01(X)  ^2(X)  + *VX)  U02(X)  + (cosu1(x))  where  uQ  = (u  uQ2)  and 

u = (Gj,  u?)  are  in  C([0, 1]:  IR2)  . (Exercise;  differentiate  G(u)(x)  = 
g(x,  u (x)),  g;  n X IRn  -*  IRm)  . 

Working  in  other  spaces  be  more  subtle.  For  example,  if 
G;  L2(0, 1)  — L2(0, 1)  is  given  by  G(u)(x)  = sinu(x),  then  G’(uQ)  ought 
to  be  given  by  (G'(u0)u)(x)  = (cosuQ(x))  u(x)  . However,  even  though 
u — (cosuQ)u  is  a bounded  linear  operator  on  L2(0, 1),  G'(uq)  does 
not  exist.  We  cannot  guarantee  that 

1 2 2 ^ 2 

f (sin(uQ(x)  + u(x))  -(sinu0(x)+(cosu0(x))u(x))  dx  < e /( u))Zdx 

1 

/•a  2 2 

by  requiring  ( u(x)  dx  < 6 no  matter  how  small  6 > 0 is  nor  what  u 
J0  __  0 
is.  Thus  the  linearized  operators  of  applications  usually  define  Frechet 

derivatives  in  some  spaces  and  not  in  others.  Further  information  on 

substitution  operators,  etc.,  may  be  found  in  [2  3]. 

Consider  now  the  mapping  K : C([0, 1];  1R)  -*  C([0, 1];  IR)  given  by 

X 2 

(2.  IS)  K(u)(x)  = J sin(u(s  ))ds  . 

0 

K is  built  up  from  three  mappings  as  follows;  If 

2 rX 

H(u)(x)  = u(x  ),  G(u)(x)  = sinu(x),  F(u)(x)  = J u(s)ds 

0 

then  K(u)  = F(G(H(u)))  . We  have  two  linear  and  continuous  mappings 
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(F  and  H)  and  the  substitution  operator  G . Their  derivatives  (as  map- 
pings of  C( [0, 1 ]:  ]R)  into  itself)  are  known.  The  derivative  of  K can 
therefore  be  computed  by  the  chain  rule  which  says:  If  A:  U -*  V is 
Frechet  differentiable  at  uQ  and  B:  V •»  Z is  Frechet  differentiable  at 
A(uQ),  then  C(u)  = B(A(u))  is  Frechet  differentiable  at  uQ  and  C‘(u0)  = 
B'(A(Uq))  A‘ (uq)  • The  proof  is  exactly  the  same  as  in  the  scalar  case  we 
reviewed  above.  Applying  this  twice  to  compute  K'  we  find 

K'(uq)  = F‘(G(H(u0)))  G‘(H(u0))  H'(uq)  . 

Recalling  what  the  derivatives  of  F,  G,  H are  yields  the  formula 

X 2 a 2 

(K'(u0)u)(x)  = J (cosuQ(s  ))  u(s  )ds  . 

While  this  formula  is  no  surprise,  we  now  know  that  this  is  a Frechet 
derivative.  Adding  to  our  tool  kit  (presently  consisting  of  differentiation 
formulas  for  continuous  linear  mappings,  substitution  operators  and  the 
chain  rule)  the  obvious  formulas  for  derivatives  of  sums  and  linear  com- 
binations, we  can  handle  most  problems  one  encounters. 

We  will  use  the  notation  B(U;  V)  for  the  Banach  space  of  continu- 
ous and  linear  mappings  T:  U -*  V equipped  with  the  norm  ||t||  = 
sup{||Tu||v:  u « U and  ||u||  < 1}  . The  principle  estimate  in  the  proof 

of  the  implicit  function  theorem  simply  says 

(2.16)  ||  F(uJ)  - F(u2)  || v < sup  ||F'(u][  + t(u2-u1))||  H uj  - u2  || y 

whenever  F;  U -*  V and  the  indicated  quantities  exist.  Thus  bounds  on 
i derivatives  are  Lipschitz  constants.  The  proof  of  (2.16)  is  sketched  later. 

Let  A,  U,  V be  Banach  spaces  and  F:AXU-*V.  Let  (\Q,  uq)  € A X U 
and  consider  the  problem 

(2.17)  F(\,u)  = F(\0,u0)  . 


Assume  that  F is  continuous  and  that  for  X € A the  mapping  u -*F(X,  u) 


18 


M.  G.  CRANDALL 


i 


V 


* 

is  Frechet  differentiable  with  the  derivative  denoted  by  F (X.,  u)  . 

u ’ 

Assume  further  that  (X,  u)  -*  F (X,  u)  is  continuous  and  F (X„,u„)  e 
' ’ ' u'  j u 0’  O' 

B(U,  V)  has  an  inverse  FJXg,  uq)  e B(V,  U)  . The  implicit  function 
theorem  (stated  informally)  says  that  under  these  assumptions  (2.17)  has 
a unique  solution  u(X)  near  uQ  for  each  X near  XQ  . Moreover  u(X) 
is  the  uniform  limit  of  {u^(X)}  where  uQ(X)  = uQ  and 

(2-18)  Vi(x)  = VK)  ■ (Fu(xo’uo))1  (F(K'Uk(M)  ■ F(Vuo))  • 

The  proof  (given  (2.16))  is  nearly  trivial.  F( X,  u)  = F(X0,uQ)  is,  under 
the  current  assumptions,  equivalent  to 


(2.19)  u = u - (Fu(XQ,  u0))’1  ( F( X,  u)  - F(X0,uq))  = T(X,u) 
where  (2.19)  is  the  definition  of  T(X,  u)  . Now 


(i) 


T«V“ol  = uo  • 


(2.20)  ^ (ii)  T(X,  u)  and  T (X,  u)  are  continuous  in  (X,  u)  , 

(HD  Tu(Xo,uo)  = 0U. 


By  (2.  20)  (ii)  and  (iii),  there  is  an  rQ  > 0 such  that  if  ||  X — X ||  , 

II  u - u0  II  u < r0>  tflen  llTu(X,  u)||<  i (the  choice  of  \ is  arbitrary,  any 
c < 1 is  all  right).  By  (2.16),  a bound  on  ||T  ||  is  a Lipschitz  constant 
and  so 


I T(  X,  Uj)  - T(X,uz)||u  < } ||  Ul  - u2 


U 


(2.  21) 


if  IU-  VA»  Ui  ’ UQ  U — f0  f°r  1 = 1’2 


One  easily  checks  that  ||t(X,'u)  - u ||  < r if  ||u-u0lly<r  provided 
that  r < rQ,  II  A - XQ  ||A  < rQ  and  2 ||  T(X,  u ) - uQ  ||  y < r . Thus  the  con- 
traction mapping  principle  guarantees  the  convergence  of  {u^(X)}  uni- 
formly on  { X:  ||  X - X_  II  . < r.  and  2 ||  T( X,  u_ ) - u_  ||  < r } . (If  you  do  not 

know  the  contraction  mapping  principle  you  should  learn  it  as  soon  as 
possible.)  Everything  follows  at  once. 


I 


I 
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We  have  not  yet  really  used  that  the  parameter  X lies  in  a Banach 
space.  When  it  does  and  F is  continuous,  differentiating  (2.18)  im- 

K 

plies 

(2.21)  u^+1(X)  = Bk(\)  uj^(X)  + A^X) 

where  * . 


(2.  22) 


Bk|X)  = (i  - lyvV1"1 


Ak(M=  -(Fu(VU0»'‘rx<X'\(X»  • 


Iterating  (2.  21)  yields 


(2.  23) 


u,1  = B,  B,  ....  B,A„  t B.  B B2 


k+1  k k-1 


1 0 k k-1 
3k  \-l 


+ . . . + B,  A,  , + A^ 


It  is  then  easy  to  show,  using  the  uniform  convergence  of  u^,  (2.22) 

and  (2.23)  that  as  k u|_(X)  converges  uniformly  in  a neighborhood 
of  X^  to 


00 

Yj  B(Mn  A(X)  = (I  - B(X)f 1 A(\) 
n=0 

= -F(\,u(M)'1  F (X,  u(X)) 

U K 

where  B = lim  B , A = lim  A . Thus  u(X)  is  continuously  differenti- 
k— 00  k k— <*>  -1 

able  and  its  derivative  is  -F  (\,  u( X.))  F (X,  u(X)),  which  is  what  one 

u X 

finds  by  differentiating  F(X,  u(X))  = F(XQ,  uQ)  and  solving  for  u’(X)  . If 
F has  higher  derivatives,  so  will  u and  this  can  be  proved  by  differen- 
tiating (2.18)  repeatedly  and  showing  the  resulting  expressions  converge. 
This  is  an  Important  point:  If  F has  continuous  derivatives  with  respect 
to  (X,  u)  of  orders  up  through  m inclusively,  then  sequence  obtained  by 
differentiating  {u^( X) } at  most  m-tlmes  converges  in  the  same  neighbor- 
hood of  XQ  that  we  showed  (u^(X)}  converged  in. 
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Finally,  if  F is  analytic  in  its  arguments  (which  we  haven't  and 
won't  define),  then  so  is  each  . Moreover,  everything  in  sight  can 
be  expanded  in  power  series  and  can  therefore  be  defined  on  the  com- 
plexificatlons  of  A,U,  V (if  the  scalar  field  was  (E  to  start  with,  this 
isn't  necessary).  By  the  above  proof  (which  works  over  the  reals  or  com- 
plexes) u(A.)  will  be  C1  from  complexified  A to  complexified  U , 
hence  analytic.  Another  argument  goes:  {u^( X) } converges  uniformly  to 
u(\)  in  a complexified  neighborhood  of  \Q,  so  u(\)  is  analytic. 

Further  details  about  analyticity  can  be  found  in,  e.  g. , [10]  and 
a good  source  for  higher  derivatives,  applications,  etc.  is  [1],  The 
article  [15]  is  also  useful  reading.  (The  author  of  [15]  expresses  the 
pleasure  of  realizing  how  neat  the  implicit  function  theorem  is.  The 
notion  he  touts  there  is  a perturbation  of  old  ones.  See  e.  g. , [8, 

Theorem  3]  and  [1,  Prop.  4.  3. 1]. ) 

Consider  (2.  2)  again.  Let  U be  the  Banach  space  of  once  con- 
tinuously differentiable  functions  u;  [0,  n]  — 1R  such  that  u (0)  = u(n) 
equipped  with  the  norm 

||u||  = max  (|u(x)|  + |u'(x)|)  . 

*[0,1] 


Let  A = 1R  and  F;  A X U -*•  V = C([0,  it]:  IR)  be  defined  by  F(p,  6,  u) 
2 2 

u'  - (l+p)u  - (cosx  + cos  x - 1 + 6)  . F and  F are  continuous  and 

u 

is  given  by 


F 

u 


Fu(p,  6,  u)u  = u'  - 2(1  + p)u  u . 


Indeed,  U3  u -*  u'  e V is  linear  and  continuous.  (sup|u'  | = ||u'  ||  < 

sup(|u|  + | u ' | ) = ||u||^)  while  u -*  -(1  f p)u2  is  a substitution  operator. 
(We  did  not  explicitly  differentiate  the  mapping  u — u2  with  this  domain 
U before,  but  one  can  think  of  it  as  the  composition  u -*  u -*  u from 
U to  V to  V . The  first  linear  map  is  continuous  and  u — u from  V 
to  V we  did  differentiate  above. ) You  check  that  F is  continuous.  We 
note  that 
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II  FU(K»  6>  u)^  - Fu(P»S,u)u  ||  v = ||  Z[(1+h)u  - (l+|I)u]u  ||  v 

< 2(  I F-F I II  u ||  y + | (1+|I)  | ||  u - u ||  u)  II  u ||  y 

which  implies  that  F is  continuous.  If  u„(x)  = sinx,  F(0,  0.  u„)  = 0 
u 0 ’ ’ 0 

while  F^fO,  0,  uQ)  is  nonsingular  (i.  e.  has  a continuous  inverse).  The 
implicit  function  theorem  applied  here  says  that  (2.9)  converges  and  the 
resulting  function  u(p,  6)  is  analytic  (with  values  in  C*([0,  tt]))  in 
(p,  6)  near  (0,  0)  . Thus  (2.  3)-(2.  5)  also  converges.  In  fact  the  u 

Jy  K 

can  be  obtained  by  successive  implicit  differentiations  of  (2.  2)  with  re- 
spect to  p and  6 . This  leads  again  to  (2.  5),  but  the  computations  are 
easier.  What  about  the  region  of  convergence  of  (2.  3)?  Perhaps  this  can 
be  estimated  explicitly  here,  owing  to  the  simplicity  of  (2.  2),  but  usually 
it  is  too  difficult  to  bother  with.  By  contrast,  it  would  not  be  hard  to 
keep  track  of  the  contraction  constants  in  (2.  9)  and  obtain  explicit  esti- 
mates as  outlined  in  the  proof  of  the  implicit  function  theorem. 

We  close  this  section  with  some  remarks  about  verifying  the  hy- 
potheses of  the  implicit  function  theorem  and  then  the  proof  of  (2.16). 

First  of  all,  if  U and  V are  concrete  Banach  spaces  (e.  g.  C(U:  IRn)  , 

2 

L (0, 1),  etc.  ) and  T:  U — V is  a concrete  linear  operator  defined  on  all 
of  U (that  is,  explicit  instructions  for  computing  Tu  given  u are 
available),  then  T is  continuous.  You  cannot  "write  down"  a discon- 
tinuous linear  operator  T:  U -*  V,  so  it  is  unnecessary  to  worry  about 
continuity  of  linear  operators  of  this  sort  in  applications.  Similarly,  if 
T;  U — V is  a concrete  linear  operator,  then  T : V -*  U will  exist  and 
be  continuous  when  T is  one-to-one  and  onto  V . Thus  to  use  the  im- 
plicit function  theorem  it  is  only  necessary  to  show  F^(\,  u)  exists  (i.  e. , 
is  a Frechet  derivative),  depends  continuously  on  (\,  u)  and  that 
^u^0’U0^  iS  one'to*one  and  °nto-  Next,  it  is  easy  to  "generalize"  the 
implicit  function  theorem.  One  doesn't  need  F (\,u)  to  exist,  etc.  It 
is  only  necessary  to  find  a linear  mapping  A such  that  contraction  map- 
ping theorem  can  be  used  to  solve  u = u - A(F(\,  u)  - F( \Q,  uq))  • But 


22 


M.  G.  CRANDALL 


this  sort  of  generalization  is  not  very  worthwhile  and  messes  up  any 
presentation  unless  the  problem  being  discussed  really  justifies  it. 
Otherwise  do  the  smooth  case  - we  all  know  now  how  to  generalize  it. 
(Again,  [8,  Thm.  3]  and  [15]  are  relevant.) 

Proof  of  (2.  lfe).  Define  g(t,  t)  = ||  F(Uj  + t(u2  - Uj))  - F(u1  + t(u2  - UjJ)  ||  . 
If  ||  F‘(Uj  + t(u2  - Uj))  II  < M for  t c [0, 1],  we  need  to  show  g(l,  0)  < 
m||u2  - ujllM  • Assume  u2  ^ uj  • The  triangle  inequality  yields 

g(t  + h,  0)  < g(t,  0)  + g(t  + h,  t) 
for  0 < t < t + h < 1,  while  the  definition  of  F’  implies 

g(t  + h,  t)  = II  F^Uj  + t(u2  -u1))h(u2  -u^  + o(h)||v  < mIIuj  - u2  II y h 

if  h > 0 is  sufficiently  small.  Using  these  facts  it  follows  easily  that 
the  set  ft  = (t  € [0,1]:  g(t,  0)  < tM  ||  u2  - ujl^}  contains  0,  contains 
t + h if  t € $ (1  [0,1)  and  h > 0 is  sufficiently  small.  Thus  max  $ = 1 , 
and  we  are  done. 

Section  3.  Implicit  Functions  and  Bifurcation 

We  rather  briefly  cover  some  of  the  types  of  results  discussed  in 
[3],  [4]  and  [5],  First,  curves  of  nontrivial  solutions  bifurcating  from 
particular  trivial  solutions  are  constructed  in  certain  situations.  A 
simple  application  to  an  equilibrium  problem  is  presented  and  the  appli- 
cation to  Hopf  bifurcation  is  very  quickly  sketched.  Then  the  stability 
of  equilibrium  solutions  obtained  this  way  is  treated  in  the  finite  dimen- 
sional setting.  Analogous  stability  results  for  the  Hopf  bifurcation  are 
reviewed  in  [6]  and  proved  In  [5],  which  covers  the  infinite  dimensional 
case  as  well.  Finally,  the  Llapunov-Schmidt  procedure  is  recalled. 

Constructing  Bifurcating  Curves 

Let  F*  A X U -*  V where  A,U,V  are  Banach  spaces.  The  dis- 
cussion of  Section  1 leads  us  to  study  the  problem  F(\,  u)  = 0 when  there 
are  known  trivial  solutions  u = 0 . A point  (X  , 0)  « A XU  is  a 
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bifurcation  point  for  F = 0 (with  respect  to  the  trivial  solutions)  if  every 
neighborhood  of  (XQ,  0)  in  A X U contains  nontrivial  solutions  of  F = 0. 
Assume  all  derivatives  appearing  below  depend  continuously  on  their 
arguments.  If  Fy(X  , 0)  is  nonsingular  (1.  e.  has  a continuous  inverse), 
then  F( X,  u)  = 0 is  uniquely  soluable  near  (XQ,  0)  for  u = u(X)  . Since 
u(X)  = 0 works,  the  uniqueness  implies  that  (XQ,  0)  is  not  a bifurcation 
point.  If  Fu(XQ,  0)  is  singular,  anything  can  happen.  In  certain  simple 
situations  additional  curves  of  nontrivial  solutions  of  F = 0 which  pass 
through  (XQ,  0)  can  be  constructed.  The  proof  of  this  is  sketched  below. 
Since  F(  X,  0)  s 0,  given  XQ  c A F can  be  written  in  the  form 

(3.1)  F(X,  u)  = Lu  + B( X - XQ,  u)  + p(\,u) 

where  L = B:  A X U -»  V is  a continuous  bilinear  mapping  and 

p satisfies 

r (i)  P( x,  o)  = o 

(3.2)  < 

L<ii)  p(v°)  = °,  p'(v°)  = 0 and  pxu(Ko' 0)  = 0 • 

In  (3.1),  B is  the  second  mixed  partial  derivative  fXu(^qi°)  interpreted 
as  a bilinear  form.  (To  compute  B(X,  u)  differentiate  X — F^( X,  0)u  at 
X0  . Fu(Xq  + X , 0)u  = Fu(X0,0)G  + B( X,  u)  + o(X)  for  fixed  u . In  a simi- 
lar way,  all  higher  derivatives  may  be  regarded  as  multilinear  mappings. ) 

2 

Theorem  1.  Let  F be  of  class  C (i.e. , have  two  continuous  deriva- 
tives), y A and  (3.1),  (3.  2)  hold.  Assume;  uQ  € U,  uQ  * 0,  LuQ  = 0 , 
Z is  a closed  subspace  of  U and  that  every  v c V can  be  uniquely 
written  in  the  form  v = Lz  + B(X,  uQ)  for  some  z « Z and  X « A . Then 
there  are  continuously  differentiable  functions  Xs  (-a,  a)  -*  A,  4<;( -a,  a)  — Z 
defined  on  some  interval  (-a,  a)  such  that  X(0)  = XQ,  ip(0)  = 0,  and 
F(X(s),  s(uQ  + 4j(  s ) ) ) = 0 for  sc  (-a,  a)  . Moreover,  all  the  solutions  of 
F = 0 in  AX  {span{uQ}  + Z}  near  (^Q>0)  have  the  form  (X(s),  s(uQ  + 
4i(s)))  for  some  sc  (-a,  a). 
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Sketch  of  Proof;  In  the  theorem,  span  {uQ}  + Z denotes  the  set  of  vec- 
tors au^  + z with  a a scalar  and  z t Z . Define  G:  K XA  XZ  - V 
by  G(s,  (3,  z)  = s F((3,  s(uQ  + z))  if  s * 0 and  extend  by  continuity  to 
s = 0 . Assume  first  that  p s 0 in  (3. 1).  Then  G(s,  X,  z)  = L(uQ  + z)  + 

B(\  - XQ,  uQ  + z)  is  independent  of  s . Now  G(0,  XQ,  0)  = Luq  = 0 and 
the  derivative  of  (\,  z)  — G(s,  X,  z)  at  (0,  X , 0)  is 

(3.  3)  (X , z ) - Lz  + B( X , u ) . 

This  mapping  is  nonsingular  by  assumption.  Hence  the  implicit  function 
theorem  implies  that  G = 0 is  solvable  for  (X,  z)  in  terms  of  s near 
(0,  XQ,  0)  . Say  X = X(s),  z = 4>(s)  is  the  solution.  (When  p s 0 , 

Ms)  = Xq,  4j(s)  = 0 works.  ) In  the  general  case,  all  the  arguments  just 
made  apply.  (3.2)  guarantees  that  (3.  3)  is  still  the  Frechet  derivative  of 
G with  respect  to  (X,  z)  at  (0,  X , 0)  . This  presentation  emphasizes 
that  the  solutions  of  F = 0 we  have  just  found  can  be  thought  of  as  aris- 
ing from  "bending"  the  curve  X = XQ,  u = suQ  associated  with  the  linear 
problem  appropriately  near  s = 0 . 

Remark:  If  A,  U,  V are  complex  Banach  spaces,  s can  be  a complex 
parameter  and  (-a,  a)  replaced  by  |s|<  a . 

The  uniqueness  assertion  of  the  theorem  follows  from  the  unique- 
ness in  the  implicit  function  theorem  and  an  additional  estimate.  More 
precisely,  let  X = X(s),  z = 4»(s)  be  the  solution  of  G = 0 Just  ob- 
tained. If  the  uniqueness  of  solutions  of  G = 0 holds  in  the  region 
II  X - Xq  ||^  < r,  ||  z ||  y < r,  | s | < r,  then  any  solution  of  F(  X,  suQ  + w)  = 0 
with  w € Z must  satisfy  X = X(s),  w = si)j(s)  provided  that  || w || u < 

I s | r,  |s  | < r and  ||  X - X ||  < r . Now  if  Xe  A,  w c Z there  is  a 
c > 0 such  that 

(3.4)  Lw  + B(X  - XQ,  uQ)  = v implies  ||w Hy  + ||  X - XQ  ||A  < c II  v|| 

because  (3.  3)  is  nonsingular.  Moreover,  given  e > 0 (3.2)  implies 
there  is  a 6 > 0 such  that 
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fllwlly,  I s | , ||\-\0IIa<6  Implies  II  P(  X,  suQ  +w)  Hv  < 

(3.5)  1 

e(  I s I + || wily)  . 

This  follows  from  (3.  3),  (2. 16)  and  p(X,  suQ  + w)  = p(\,  suQ  + w)  - p(X,  0)  . 
Rewriting  F(X,  suQ  + w)  = 0 as 

Lw  + B(s(X-  X ),  uQ)  = -B(X-  XQ,  w)  + p(X,  suQ  + w) 

and  using  (3.4),  (3.  5)  yields 

(3.6)  II  w ||  y + | s|  II  \ - XQIIA  < c(  ||  B ||  II  X-  XQIIA  II  w II  y 

+ e(  I s | + II w ||  y)) 

where  ||  B ||  = sup  { ||  B(t , u)  II : II  X II  A>  II  u ||  y < 1 } . Assuming 
c||b||  ||\- \ II  + ce  < £,  (3.6)  implies  ||  w ||  y + | s | ||  X - XQ  II A < 2ce  I s | 
But  e > 0 is  at  our  disposal,  and  the  local  uniqueness  follows. 

Example  1.  Let  A = 1R,  U = {u  e C2([0,7r]);  u'(0)  = u'(ir)  = 0} 
under  the  norm 

II  u ||  = max  (|u(x)|  + | u*(>c)  | + |u"(x)|)  , 

U 0<X<TT 

2 

V = C([0,ir])  with  the  usual  norm  and  F(  X,  u)  = u"  + Xu  - u (u+X)  . (Via 
the  discussion  following  (1.  8)  the  problem  F = 0 corresponds  to  equilib 
rlum  solutions  of  (1.2).)  Now  F^ ( X.,  0)  u = u " + Xu  . The  problem 
u " + Xu  = v,  u'(0)  = u'(ir)  = 0,  always  has  a unique  solution  unless 

7 

\ = n for  n = 0, 1,  . . . , in  which  case  it  is  solvable  exactly  when 

IT 

J v(x)  cosnxdx  = 0 . Therefore  the  possible  bifurcation  points  corre- 

0 2 

spond  to  \ = 0, 1,  4,  . . . . Let  = n . The  decomposition  (3. 1)  hold 

with  Lu  = u"  + n2u,  B(X,  u)  = Xu  . Set  u (x)  = cosnx  and  let  Z be  a 

u 2^ 

subspace  of  U . We  want  the  problem  Lz  + B(X,  u^)  = z"  + n z + 

A A 

X cosnx  = v to  be  uniquely  solvable  for  z,  X . Choosing 
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Z = {u  « U : J u(x)  u (x)dx  = 0},  this  Is  the  case.  Indeed, 

a 2 0 a 1 

\ - J (cosnx)  v(x)dx  (if  n = 1,  2, . . . ) or  X = — J v(x)dx  if  n = 0 , 

and  you  can  work  out  the  formula  for  z . Let  Lz  + B(  £ , uQ)  = v have 
the  solution  z = KjV,  X = K2v  . To  solve  G = 0 the  implicit  function 
theorem  offers  the  prescription:  = z^  - KjG(s,  X^,  z^),  X^  = \ ' 

G(s,  X^,  z^)  with  zQ  =0,  XQ  = XQ  . Expressed  in  terms  of 

uk  = s(uo  + Zk)  thls  scheme  is  uk+1  = uk  - Ki  F<  VV’  3 = a\  - 

K2F(W  or 

TT  2 j TT  2 

s Xkkl  3 SXk  - </0u 0dx)'  /0(uk  + Xkuk  - Vuk  x x,uodx 

U^1  + n2  uwi  = (°2  - vuk  + uk<uk  + K)  - *(K±i  - VK 


k+1  V 0 


Uk+1(0)  = Uk+1(,T)  = ° 


Vi  uo  dx  = s / uo  dx  • 

Ao  an  exercise  compute  a few  of  the  derivatives  of  the  solution 

(X,  u)  with  respect  to  s at  s = 0 . Observe  also  that  the  entire  analysis 
2 

is  correct  if  the  "u  (u  + X)"  term  is  replaced  by  anything  suitably  small 

3 2.  4 n 2 

at  u = 0 . For  example  xu(x)J  + sin(u'(x))Z  + (u"(ir/2))4  + f (u(.7-r\)ZdT 

is  ouite  acceptable.  0 

Example  2.  (Hopf  Bifurcation) 

Consider  the  problem 

(3.7)  ^ + f(P,u)  = 0 . 

(We  have  in  mind  (1.  7),  but  have  changed  notation  for  convenience. ) 
Assume  f(p,  0)  = 0 and  let  ±1  be  simple  eigenvalues  of  the  Jacobian 
matrix  f^p^,  0)  • Assuming  a little  more  we  find  nontrivial  periodic 
solutions  of  (3.7)  near  p = pQ,  u = 0 . If  pt  = t,  (3.7)  becomes 

(3.  8)  ^ + pf(p,  u)  = 0 
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and  we  look  for  2tt- periodic  solutions  of  (3.  8)  (or  2irp- periodic  solutions 

of  (3.  7)).  Let  U = {u:  1R  -*  IR0:  u is  once  continuously  differentiable 

and  2ir- periodic },  V = {v*  ]R  -*•  1R n : v is  continuous  and  2ir-periodic  } 

2 

with  the  usual  norms.  Let  A = IR  = {\  = (ft  p):  ft  p * 1R  } . Define 
(3.9)  F(\,u)  = ^ + pf(p,u)  . 

Let  Lq,  L1  be  the  matrices  Fu(Ft0»  °)  and  F (pQ,  0)  respectively.  Then 
L and  B in  (3. 1)  are 

du 

LU  = d7  + Lou’  B(x» u)  = pLou  + kLiu  ’ 

••It  “It 

where  X = (p,p)  . Let  LQa  = ia  and  uq  = Re  e a,  u^  = Im  e a . 

2tt 

Choose  Z = {u  t U:  J (u(t),  u (t))  dT  = 0 for  i = 0, 1 } . The  map- 

0 IR 

A A A /V 

ping  (X,  z)  -*  Lz  + B(X,  u^)  is  one-to-one  and  onto  exactly  when 


(i)  ki  is  not  an  eigenvalue  of  LQ  for  k = 0,  2,  . . . 
(3.  I0)^(ii)  If  (3([i)  is  the  eigenvalue  of  LQ  + p.L^  near  1 
for  p near  then  Re  P‘(p0)  * 0 . 


The  assumptions  (3.10)  were  made  by  Hopf  and  the  current  proof  simpli- 
fies and  generalizes  his.  It  is  an  interesting  (and  somewhat  extended) 
exercise  to  verify  the  preceding  assertion  relating  Hopf's  assumptions  to 
the  mapping  (£ , z)  — Lz  + B(X,  uQ)  . See  [5]  for  details. 

Simple  Eigenvalues  and  Stability  in 

Consider  (1.  7)  with  A = IR,  U = lRn  . Now  F:  IR  X lRn  -*  lRn  . Let 

us  consider  the  conditions  under  which  Theorem  1 applies  at  some  X « ]R . 

0 

Let  Lq,  Lj  be  the  matrices  LQ  = Fy( \Q,  0),  Lj  = F^u( XQ,  0)  . Then  L and 
B in  (3. 1)  are  given  by 


(3.11) 


r Lu  = L0u 

V B( X — XQ,  u)  = ( X - XQ)  Lj  u . 
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Since  L is  to  be  singular,  there  is  a uQ  * 0 such  that  Lquq  = 0 . 

When  can  we  choose  a subspace  Z of  IRn  so  that 

(3.  12)  IR  X Z 3 (£,  z ) - LQz  + \LjU  € lRn 

is  one-to-one  and  onto  Fn  ? Clearly  this  requires  u^  ^ Z since  LQ 
must  be  1-1  on  Z.  Thus  the  dimension  of  Z < n-1  . If  ( 3.  12)  is  to  be 
onto  IRn  the  dimension  of  Z clearly  must  be  (n-1)  . It  follows  that 
N(LQ)  (the  null  space  of  LQ  ) must  be  span{uQ}.  Thus  0 is  a geo- 
metrically simple  eigenvalue  of  Lq  (i.  e.  dim  N(L  ) = 1)  . When  this 
holds  (3.12)  is  onto  for  any  choice  of  Z satisfying  uQ  <j  Z and 
dim  Z = n-1  provided  <j  R(LQ)  (the  range  of  LQ  ).  It  is  easy  to 

construct  examples  in  which  these  conditions  hold  and  the  algebraic 

00  . 

multiplicity  of  u^,  (i.  e.  dim  U N(LQ) ) is  greater  than  one.  It  is  useful 
to  define  0 to  be  an  Lj- simple  eigenvalue  of  LQ  when  the  above  con- 
ditions (N(LQ)  = span{u0),  uQ  <J  R(L))  hold.  If  I is  the  identity, 
then  the  I- simple  eigenvalues  are  precisely  the  algebraically  simple 
eigenvalues  (checking  this  is  an  easy  exercise).  We  further  define  x 
to  be  an  Lj-simple  eigenvalue  of  LQ  if  0 is  an  Lj-simple  eigenvalue  of 
Lg  - x Lj  . (At  this  point  we  are  thinking  of  LQ,  Lj  as  arbitrary  nxn 
matrices. ) 

Theorem  2.  Let  L^,  Lj  ( B(IR  ; IR  ) and  « IR  be  an  Lj-simple  eigen- 
value of  Lq  . Then  every  L * B(JR  > ]R n)  near  has  a unique  Lj- 
simple  eigenvalue  «(L)  near  kq  . If  N(LQ)  = span  (uQ },  dim  Z = n-1  and 
u0  <i  Z,  the  corresponding  eigenvector  can  be  uniquely  written  in  the 
form  uQ  + z(L)  where  z(L)  « Z.  L — k(L)  and  L-  z(L)  are  analytic 
functions. 

Sketch  of  proof;  Define 

3:  B(]Rn;  TRn)  X IR  X Z — lRn 

by  J(L,  x,  z)  = L(uQ  + z)  - kL1(uq  + z)  . Then  3(LQ,  kq,  0)  = LQuo  - = 0 


\ 
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and  the  derivative  of  (k,  z)  -*■  g(L,  k,  z)  at  (Lq,k0,  0)  is 
(k,  z)  - (Lq  - KqL^z  + k L1uq  . 

This  is  nonsingular  by  assumption,  so  g = 0 can  be  solved  by  the  im- 
plicit function  theorem  for  k(L),  z(L)  near  (kq,  0)  when  L is  near  Lq  . 

One  next  checks  the  uniqueness  and  simplicity.  (See  [4].) 

Theorem  3.  Let  F:  ]R  X IRn  - ]Rn,  F( 0)  s 0,  XQ  « IR,  LQ  = F^( XQ,  0)  , 

Lj  = F^u( XQ,  0)  and  0 be  an  I-simple  and  an  L^-simple  eigenvalue  of  i 

LQ  . Let  N(LQ)  = span{uQ } and  ]R  = ZXspan{uQ}.  Let  (X,  u)  = (X(s),  1 

s(uQ  + ip( s))  be  the  curve  of  solutions  of  F = 0 constructed  on  Theorem  1.  ‘ 

Let  k(s)  be  the  unique  I-simple  eigenvalue  of  Fu(X(s),  s(uQ  + i|j( s>>) 
near  0 for  s near  0 constructed  in  Theorem  2.  Then  there  is  a con- 
tinuous real-valued  function  A of  s such  that 

(3. 14)  k(s)  = sX'(s)  A(s)  . 
for  s near  0 and  A(0)  is  determined  by 

(3.15)  A(0)  uQ  + LjUo  £ R(LQ)  . 

Before  proving  the  result,  let  us  consider  its  utility  as  regards  the 
stability  question.  If  the  eigenvalues  of  Fu<X,  0)  have  positive  real 
parts  for  X < XQ  (so  the  trivial  solution  is  stable)  and  the  only  purely 
imaginary  eigenvalue  for  X = XQ  is  0,  then  F^( X( s) , u(s))  has  eigen- 
values with  positive  real  part  except,  perhaps,  one  corresponding  to 

the  perturbed  zero  eigenvalue  of  ^(Xq,  0)  = ^u^  ‘ T^is  *s  ‘ i 

In  this  situation  then,  k(s)  > 0 implies  su  + Si)j(s)  is  a stable  solution 

0 i 

of  (1.  7)  with  X = X(s)  . If  k(s)  < 0,  then  this  solution  is  unstable. 

Theorem  2 explains  how  to  compute  k(s)  (by  the  implicit  function  theorem). 

However,  this  computation  requires  detailed  knowledge  of  X(s),  4>(s)  • 

Theorem  3 says,  in  fact,  that  for  small  s it  is  enough  to  know  the  signs 
of  sX'(s)  and  A(0)  . No  additional  computations  are  necessary.  Re- 
calling that  the  Iterative  approximations  fx^(s)}  to  X(s)  provided  by 
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the  implicit  function  theorem  have  the  property  that  {\^(s)}  converges 
to  X (s)  (see  Section  2),  this  is  quite  pleasant.  If  we  are  in  the  anal- 
ytic case,  (3.14)  implies  that  the  leading  terms  in  the  series  of  «(s)  and 
sX'(s)  A (0)  are  equal.  Finally,  let  y(X)  be  defined  by  Y(^q)  = 0 and 

<3-16)  FU(M)  (uQ  + '(M)  = Y(M  (uq  + w(X)) 

where  w(X)  < 2,  w(0)  = 0 . Differentiating  (3.16)  at  X = XQ  yields 

(3*17)  Low'(xo)  + Liuo  = ^,(xo)uo  • 

Thus  A(0)  = -y’(Xo)  . In  the  situation  we  just  described,  y(\)  > 0 for 
X < XQ  and  y(X)  < 0 for  X > XQ  . Hence  y(XQ)  < 0 . (Note  y'(Xq)  = 0 
is  Impossible  since  L^uq  <j  R(LQ). ) Combining  these  observations  with 
(3.  14),  we  have  k(s)  > 0 is  sX'(s)  > 0 and  «(s)  < 0 if  sX'(s)  < 0 . 

This  is  a precise  version  of  the  statement  "supercritical  bifurcations  are 
stable,  subcritical  bifurcations  are  unstable". 

Proof  of  Theorem  3.  Let  Fu(s)  = Fu(X(s),  suQ  + s^s)),  etc. , and 
Fu(s)(uo  + z(s))  = k(s)  (u0  + z(s))  (z(s)  = z(Fu(s))  where  z(L)  is  the 
function  of  Theorem  2. ) We  have 

r . 

U)  Fu(sHuo  + <s+(s)n  + (sX'(s))  s F (s)  = 0 
( 3. 18)  ( and 

(ii)  fu(s)(uq  + z(s))  - k(s)(uq  + z(s))  = 0 , 


where  (3.18)  (i)  follows  from  differentiating  F(X(s),  suQ  + sip( s))  = 0 . 
Try  the  ansatz 


(3.19) 


r k(s)  = sX'(s)  A(s) 

z( s)  = (s4j(s))'  + sX'(s)  B(s) 


where  A has  values  in  IR  and  B in  Z . Subtracting  (3. 18)  (i)  from 
(3.18)  (ii)  and  using  (3.19)  we  find  that  if  3 = 0 where  3:  IR  xIRxZ-*-]Rn 
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is  given  by 

(3.  20)  3(s,  A,  B)  = F (s)B  + (s_1F  (s)  + A(u  + (s^s))'  + sV(s)B) 

U A.  U 

then  k and  z as  given  by  (3.19)  solve  (3.18)  (ii)  . Now 
(3.21)  3(0,  A,  B)  = LqB  - (L1uq  + Auq)  . 

Choosing  AQ,  BQ  in  IR  and  Z respectively  so  that  LqBq  - Aquq  = LjUq 
we  have  3(0,  AQ,  BQ)  = 0 . Moreover  the  derivative  of  3 with  respect 
to  (A,  B)  at  (0,A0,BQ)  is 

A.  A A A 

(A,  B -*  LQ  B + AuQ 

which  is  nonsingular.  Everything  follows  now  by  the  implicit  function 
theorem. 

We  want  to  emphasize  the  following  distinction.  Assume  that 
N(Lq)  = span{uQ}  is  one  dimensional,  LQ  = anc*  Lj  = F^u^o’ 

In  order  to  construct  bifurcating  curves  as  in  Theorem  1 we  need 
LjUq  <1  R(LQ)  • orc*er  to  use  Theorem  3 for  a stability  criterion  we  need 
uQ  J R(L0)  . These  are  different  conditions.  One  easily  sees,  from 
(3.16),  (3.17)  that  if_  (3.16)  has  a smooth  solution  y(\),  w(\)  then 
■y1  ( \q)  * 0 is  equivalent  to  LjUq  <j  R(LQ)  or  uQ  <J  R(LQ)  • However,  it 
may  happen  (see  [j])that  L^uq  <j  R(LQ)  and  (3.16)  has  no  smooth  solu- 
tions and  likewise  (3.18)  (ii)  has  no  smooth  solutions.  In  this  case, 
stability  cannot  be  studied  by  assuming  (3.18)  (ii)  has  solutions  k(s), 
z(s)  given  by  power  series.  (As  has  occurred.) 

For  further  reading  in  this  direction  (extension  to  Hopf  bifurcation, 
Infinite  dimensions,  etc.  ) and  relevant  historical  remarks  see  [4],  f 5] and 
the  review  article  [6].  The  information  that  the  sign  and  zeros  of  k(s) 
could  locally  be  read  off  from  the  same  information  about  sX.'(s)  was 
first  proved  in  [4].  The  more  elegant  "factored  form"  of  Theorem  3 was 
remarked  in  [11]. 

Liapunov  - Schmidt  FTocedure 


Let  G:  X — V where  X and  V are  Banach  spaces.  (X  corresponds 


I 
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to  A XU  above.)  Assume  G(xq)  = 0 . We  want  to  study  G = 0 near 

u . Assume  X and  V can  be  written  in  the  form 
0 


(3.22) 


(i)  X = N(G,(xq))®  Z 


(ii)  V = R(G'(x0))  ®W 


where  Z and  W are  closed  subspaces  of  X and  V respectively.  This 
assumes  that  R(G'(xq))  is  closed.  The  decomposition  (3.  22)  is  always 
possible  if  N(G'(xq))  and  V/R(G’(xQ))  are  finite  dimensional  or  if 
R(G'(xq))  is  closed  and  X,  V are  Hilbert  spaces.  Given  x e X we  write 
x = x^  + xz  + xQ  where  xN  e N(G'(xQ))  and  xz  « Z . Let  P project 
V on  R(G'(xq))  along  W . The  problem  G(u)  = 0 can  be  rewritten  as 


[ <‘>  H(xn’  xz}  3 P(G(xn  + xz  + xo))  = 0 

(3.23)  / 

I (ii)  (I-P)  G(xn  + xz)  = 0 . 

Now  H(0,  0)  = 0 and  the  derivative  of  Z 9 xz  - H(xN>  xz)  e R(G‘(xo))  at 
x^  =0,  xz  = 0 is  xz  -*  PG'(x0)  xz  which  is  one-to-one  and  onto.  Thus 
the  implicit  function  theorem  says  that  (3.  2 3)  (i)  can  be  solved  for  xz  = 
xz(xN)  and  G = 0 is  locally  equivalent  to  (I  - P)  G(XN  + X2^xn^  = 0 > 
an  equation  "in  N(G'(xq))"- 

Example;  X=lRxU,  V=U,  T:U-*U  compact  linear,  p:  R XV-»U 
satisfies  (3.2)  at  \Q  « IR  and  G(\,  u)  = u - \Tu  + P(\,  u)  . Then 
G'(\  , 0)  (X  , G)  = (I  - ^qT)u  has  the  null  space  {(X,  u);  u « N(I  - \QT) } of 
dimension  n + 1 where  n = dim  N(I  - X^T)  . R(I  - X^T)  is  also  of  co- 
dimension n (Riesz-Schauder  theory).  The  above  procedure  then  leads 
to  "n  + 1 equations  in  n unknowns".  These  are  (if  n > 1)  usually 
attacked  by  formal  expansion  methods.  In  the  analytic  case  one  is  con- 
fronted with  problems  of  real  analytic  geometry.  See  [24]. 

In  situations  where  the  construction  of  Theorem  1 applies  the 
Liapunov -Schmidt  procedure  can  be  used  to  first  reduce  the  "dimension- 
ality". This  does  not  seem  advantageous,  since  it  requires  more 


I 
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iterating  (once  for  Liapunov- Schmidt  and  then  once  more)  than  our  origin- 
al presentation. 


REFERENCES 

[1]  Cartan,  H. , Differential  Calculus,  Hermann,  Paris;  Houghton 
Mifflin,  Boston,  1971. 

[2]  Chow,  S.  N. , J.  K.  Hale  and  J.  Mallet-Paret,  Applications  of 
generic  bifurcation,  I and  II,  Arch.  Rational  Mech.  Anal.  59(197  5), 
159-188  and  62(1976),  209-235. 

[3]  Crandall,  M.  G.  and  P.  H.  Rabinowttz,  Bifurcation  from  simple 
eigenvalues,  J.  Functional  Anal. , 8 (1971),  321-340. 

[4]  , Bifurcation,  perturbation 

of  simple  eigenvalues  and  linearized  stability,  Arch.  Rational 
Mech.  Anal.,  52(1973),  161-180. 

[5]  , The  Hopf  Bifurcation 

Theorem,  TSR  #1604,  Mathematics  Research  Center,  University  of 
Wisconsin,  Madison,  Wisconsin,  197  6 (A  revised  extract  from  this 
report  is  also  to  appear  in  Archive  Rat.  Mech.  Anal.). 

[8]  , The  principle  of  exchange 

of  stability,  Proceedings  of  the  International  Symposium  on 
Dynamical  Systems,  Gainsville,  Florida,  1976  (to  appear). 

[7]  Fife,  P. , Branching  phenomena  in  fluid  dynamics  and  chemical 
reaction-diffusion  theory,  p.  23-83  in  Eigenvalues  of  Non-linear 
Problems,  Edizioni  Cremonese,  Rome,  1974. 

[8]  Hildebrandt,  T.  H.  and  L.  M.  Graves,  Implicit  functions  and 
their  differentials  in  general  analysis,  Trans.  Amer.  Math.  Soc. , 
29(1923),  127-153. 

[9]  . Hale,  J,  K. , Ordinary  Differential  Equations,  Wiley-Interscience, 

New  York,  1969. 

[10]  Hille,  E.  and  R.  S.  Phillips,  Functional  Analysis  and  Semi-Groups, 
Amer.  Math.  Soc.  Colloquium  Publications,  31,  Providence  1957. 


M.  G.  CRANDALL 


Joseph,  D.  D.  and  D.  A.  Nleld,  Stability  of  bifurcating  time 
periodic  and  steady  solutions  of  arbitrary  amplitude,  Arch. 
Rational  Mech.  Anal.  58(1975),  369-380. 

Keller,  J.  B.  and  S.  Antman  (eds. ),  Bifurcation  Theory  and  Non- 
linear Eigenvalue  Problems,  Benjamin,  New  York,  1969. 
Kirchgassner,  K.  and  H.  Kielhofer,  Stability  and  bifurcation  in 
fluid  dynamics,  Rocky  Mountain  Math.  J.  3(1973),  275-318. 
Marsden,  J.  E.  and  M.  McCracken,  The  Hopf  Bifurcation  and  its 
Applications,  Applied  Mathematical  Sciences  19,  Springer  Verlag, 
New  York,  1976. 

Nijenhuis,  A. , Strong  derivatives  and  inverse  mappings,  Amer. 
Math.  Monthly  81(1974),  969-980. 

Nirenberg,  L. , Topics  in  Nonlinear  Functional  Analysis,  Courant 
Institute  Lecture  Notes,  Courant  Institute  of  Mathematical 
Sciences,  1974. 

Rablnowitz,  P.  , A survey  of  bifurcation  theory,  in  Dynamical 
Systems:  An  International  Symposium,  Volume  1,  L.  Cesari, 

J.  Hale,  J.  La  Salle  eds. , Academic  Press,  New  York,  1976, 
p.  83-97. 

, Some  aspects  of  nonlinear  eigenvalue  problems, 

Rocky  Mountain  Math.  J.  3(1973),  161-202. 

, Variational  methods  for  nonlinear  eigenvalue  prob- 
lems, p.  141-19  5 in  Eigenvalues  of  Non-linear  Problems,  Edizioni 
Cremonese,  Rome,  1974. 

Sattinger,  D.  H.,  Topics  in  Stability  and  Bifurcation  Theory, 
Lecture  Notes  in  Mathematics  309,  Springer-Verlag,  New  York, 

197  3. 

Schwartz,  J.  T. , Nonlinear  Functional  Analysis,  Gordon  and 
Breach,  New  York,  1969. 

Stakgold,  I. , Branching  of  solutions  of  nonlinear  equations, 

SIAM  Rev.  13(1971),  289-332. 


BIFURCATION  AND  THE  IMPLICIT  FUNCTION  THEOREM 


35 


I 

[23]  Vainberg,  M.  M. , Variational  Methods  for  the  Study  of  Nonlinear 
Operators,  Holden-Day,  San  Francisco,  1964. 

[24]  Vainberg,  M.  M.  and  V.  A.  Trenogin,  Theory  of  Branching  of 

! Solutions  of  Nonlinear  Equations,  Monographs  and  Textbooks  on 

, Pure  and  Applied  Mathematics,  Nordhoff  International  Publishing, 

Leyden,  1974. 


Imperfect  Bifurcation 

Edward  L.  Reiss 

Abstract 

Bifurcation,  or  branching  of  solutions,  is  a prominent 
feature  of  many  nonlinear  stability  problems.  In  elastic 
stability,  bifurcation  is  called  buckling,  and  in  hydro- 
dynamic  stability  bifurcations  are  called  transitions.  In 
many  applications,  the  bifurcations  are  perturbed  by  imper- 
fections, impurities,  and  other  inhomogeneities  in  the 
physical  system.  These  perturbations  are  frequently 
singular.  A new  method  for  analyzing  singular  perturbations 
of  bifurcations  is  described.  The  method  of  matched  asymp- 
totic expansions  is  employed  in  the  analysis.  Applications 
of  the  method  to  the  buckling  of  crooked  columns,  and  to 
forced  nonlinear  oscillations  are  presented. 

1 . Introduction . 

The  buckling  of  an  elastic  column  by  an  axially  com- 
pressive thrust  was  one  of  the  first  bifurcation  problems  to 
be  studied.  The  theory,  which  is  called  the  Elastica,  was 
derived  and  analyzed  by  Euler.  In  this  theory,  it  is  as- 
sumed that  the  axis  of  the  column  is  a straight  line  before 
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the  application  of  the  thrust.  The  axis  remains  straight 
until  the  thrust  reaches  a critical  value  that  is  called  the 
Euler  buckling  load.  The  column  bends  into  a buckled  equi- 
librium state  when  the  thrust  exceeds  this  critical  value. 

We  ass'jne  that  the  x axis  coincides  with  the  initial 
axis  of  the  column.  It  is  normalized  by  the  length  of  the 
column,  so  that  0 <_  x <_  1.  The  horizontal  and  vertical  dis- 
placements of  the  buckled  axis  are  denoted  by  u(x)  and  w(x) 
respectively.  Furthermore,  <f>(x)  is  the  angle  between  the 
tangent  to  the  column's  axis  and  the  x axis.  Then  the 
Elastica  boundary  value  problem  is  given  by: 


(1.1)  4>"  + X sin  <)>  = 0 , 


(0)  = (1)  = 0 ; 


(1.2) 


u'  = cos  4>  - 1 , 
w'  = sin  4>  , 


u (0)  = 0 , 
w (0)  = w(l)  = 0 . 


A prime  denotes  differentiation,  and  ) > 0 is  a parameter 
proportional  to  the  thrust.  The  boundary  conditions  in 

(1.1)  and  (1.2)  imply  that  the  ends  of  the  column  are  pinned 
(simply  supported) . 

The  solutions  of  (1.1)  and  (1.2)  have  been  obtained  ex- 
plicitly in  terms  of  elliptic  functions  [1-3].  The  results 
are  summarized  in  the  "bifurcation  diagram"  given  in  Figure 
1.1.  We  observe  that  4>  = u = w=  0isa  solution  of  (1.1)  and 

(1.2)  for  all  values  of  X.  This  is  the  straight  state  of 
the  column.  Buckled  states  branch  from  the  straight  state 
at  the  buckling  loads 

2 

X = Xn  = (nil)  , n = 1,2,...  . 


(1.3) 
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The  minimum  buckling  load  X1  is  the  Euler  buckling  load. 

The  buckled  state  branching  from  X exists  for  all  X > X . 

n — n 

Two  of  these  states  are  sketched  in  Figure  1.1.  In  more 
general  discussions  of  bifurcation,  the  straight  state,  the 
buckled  states,  and  the  buckling  loads  (1.3)  are  called,  the 
basic  solution,  the  bifurcation  branches,  and  the  bifurca- 
tion points  respectively.  We  shall  adhere  to  this 
terminology. 

The  solutions  of  the  Elastica  demonstrate  one  of  the 
main  features  of  bifurcation. .. the  sharp  transition  in  solu- 
tion multiplicity  at  the  bifurcation  points.  However, 
buckling  experiments  show  that  bifurcation  does  not  occur 
for  real  columns.  The  experimental  results  are  qualita- 
tively similar  to  the  heavy  curves  in  Figure  1.2.  The  sharp 
transition  of  bifurcation  is  replaced  by  the  smooth  transi- 
tion of  reality. 

This  discrepancy  between  experiments  and  theory  is  at- 
tributed to  small  imperfections  that  are  always  present  in 
experimental  specimens.  The  small  deviations  of  the  axis  of 
the  experimental  column  from  the  assumed  straight  line  of 
the  Elastica  is  a dominant  imperfection.  Experimental  re- 
sults show  that,  in  general,  the  smaller  the  imperfections 
the  closer  the  agreement  between  experiments  and  theory. 

Thus  the  column  corresponding  to  experimental  curve  2 in 
Figure  1.2  has  less  imperfections  than  column  1. 

Similar  phenomena  occur  in  the  buckling  of  elastic 
plates  and  shells.  The  classical  theory  for  the  buckling  of 
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axially  compressed,  cylindrical  elastic  shells  [4,5]  gives  a 
bifurcation  problem.  For  many  ranges  of  parameters,  the  bi- 
furcation from  the  minimum  bifurcation  point  X1  is  as  shown 
in  Figure  1.3.  In  this  theory,  the  midsurface  of  the  shell 
is  assumed  to  be  a perfect,  right  circular  cylinder.  One  of 
the  principal  imperfections  in  shells  is  the  waviness  of  the 
initial  midsurface  of  experimental  specimens.  Representa- 
tive experimental  results  are  sketched  in  Figure  1.3.  A 
maximum  thrust  Xg  is  reached  at  which  the  shell  jumps  to  a 
large  amplitude  equilibrium  state.  The  dashed  segment  of 
the  curve  in  the  figure  is  conjectured,  because  it  is  not  ob- 
served in  experiments.  The  value  of  Xg  is  the  experimental 
buckling  load.  It  may  be  1/2  or  as  small  as  1/5  of  X^.  The 
difference  X^  - X£  is  decreased  for  experiments  on  carefully 
manufactured  specimens.  Thus  imperfections  have  a more 
dramatic  effect  in  shell  buckling  than  column  buckling. 

For  column  buckling,  as  shown  in  Figure  1.1,  the  bifur- 
cation branches  exist  near  the  bifurcation  points  only  to 
the  right  of  the  bifurcation  points.  This  is  called  super- 
critical bifurcation.  Subcritical  bifurcation  occurs  if  the 
bifurcation  solutions  branch  only  to  the  left  near  the  bi- 
furcation points;  see  Figure  5.1.  For  the  bifurcation  shown 
in  Figure  1.3,  there  is  a bifurcation  branch  for  values  of  X 
above  and  below  X^.  This  is  called  transcritical  bifur- 
cation. * 


•This  term  is  due  to  T.  B.  Benjamin. 
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Imperfections  are  significant  in  other  nonlinear  sta- 
bility problems,  such  as  the  thermal  convection  of  a viscous 
fluid  between  two  parallel  plates.  The  bottom  plate  is  uni- 
formly heated  and  the  top  plate  is  uniformly  cooled.  The 
Rayleigh  number  \ is  the  basic  parameter.  It  is  propor- 
tional to  the  temperature  difference  between  the  top  and 
bottom  plates.  The  Boussinesq  theory  [6]  is  used  to  de- 
scribe the  convection  mathematically.  It  yields  a bifurca- 
tion problem.  The  basic  solution  is  the  conduction  state, 
in  which  the  heat  is  transmitted  from  the  bottom  to  the  top 
plate  by  conduction  only  (no  fluid  motion) . Steady  convec- 
tion solutions  bifurcate  supercritically  from  the  conduction 
state  at  an  infinite  sequence  of  bifurcation  points.  The 
lowest  bifurcation  point  is  called  the  critical  Rayleigh 
number.  However,  careful  experiments,  see  e.g.  [7] , show 
that  bifurcation  does  not  occur.  Instead,  there  is  a smooth 
transition  similar  to  the  buckling  of  imperfect  elastic 
columns,  as  shown  in  Figure  1.2.  Experimental  and  theoreti- 
cal attempts,  see  e.g.  [7,8],  to  ascribe  this  discrepancy  to 
"non-Boussinesq"  effects,  such  as  variation  of  the  fluid 
viscosity  with  temperature,  have  not  been  successful.  Small 
imperfections  such  as,  thermal  noise  in  the  plates  [9] , or 
lateral  variations  in  the  distance  between  the  plates,  are 
sufficient  to  qualitatively  explain  the  deviation,  as  we 
shall  show  in  a future  publication. 

To  analyze  mathematically  the  perturbations  of  bifurca- 
tions caused  by  imperfections,  inhomogeneities,  and  other 
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impurities,  we  modify  the  classical  bifurcation  theory  by 
introducing  a small  parameter  6 which  characterizes  the  mag- 
nitudes of  these  imperfections.  Thus  we  consider  the 
general  nonlinear  problem 

(1.4)  F[y;X,6]  = 0 , 

where  y = y(X,6)  is  the  solution  vector.  The  corresponding 
bifurcation  problem  is 

(1.5)  F[y;X,0]  = 0 

with  solutions  y(X)  = y(X,0).  We  refer  to  (1.4)  as  the  per- 
turbed problem,  and  to  the  solutions  of  (1.4)  and  (1.5)  as 
the  perturbed  and  bifurcation  branches,  respectively.  We 
assume  that  as  <5  -*•  0,  the  perturbed  branches  y(X,6)  the 
bifurcation  branches  y(X),  and  the  perturbed  problem  (1.4)  -*■ 
the  bifurcation  problem  (1.5). 

The  principal  methods  for  analyzing  perturbed  problems 
are  due  to  Koiter  [10],  Keener  and  H.  Keller  [11],  J.  Keller 
[12],  Chow,  Hale  and  Mallet-Paret  [13],  and  Matkowsky  and 
Reiss  [14].  The  methods  in  [10-13]  are  local  analyses  near 
bifurcation  points.  Koiter' s method  employs  a variational 
principle  and  uses  the  Ritz  method.  It  is  a formal  proce- 
dure that  is  widely  used  by  structural  engineers.  The  re- 
view [15]  contains  references  to  applications  of  the  method. 
Keener  and  H.  Keller's  [11]  method  is  a rigorous  iteration 
procedure.  It  is  applicable  apparently  only  to  perturbed 
branches  that  have  a "nose",  such  as  occurs  in  shell  buck- 
ling (Figure  1.3),  and  not  to  perturbed  branches  as  shown  in 
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Figure  1.2.  The  nonlinear  forced  oscillation  of  an  acoustic 
waveguide  was  treated  by  J.  Keller  [12]  as  a perturbed  prob- 
lem by  a formal  perturbation  expansion  near  the  bifurcation 
points  of  periodic  solutions  of  the  free  oscillation  pro- 
blem. A modification  of  the  Lyapunov-Schmidt  method  of 
classical  bifurcation  theory  [16]  , and  a theorem  of  Thom 
[17]  on  the  generic  forms  at  the  singularities  of  nonlinear 
mappings,  were  used  by  Chow,  Hale  and  Mallet-Paret  [13]  to 
give  a rigorous  analysis  of  the  perturbed  problem  near  bi- 
furcation points.  The  analysis  of  Matkowsky  and  Reiss  [14] 
will  be  briefly  described  in  Section  2.  It  employs  the 
method  of  matched  asymptotic  expansions  [18-20]  to  obtain 
formal  asymptotic  expansions  of  the  perturbed  branches  as 
6*0  near  bifurcation  branches  which  are  known  either 
exactly  or  approximately.  Thus  when  bifurcation  branches 
are  known  globally,  for  example  by  numerical  computation, 
the  method  [14]  gives  global  representations  of  the  per- 
turbed branches . Applications  of  the  method  to  the  buckling 
of  crooked  columns,  and  to  the  forced  response  of  a non- 
linear oscillator  are  given  in  Sections  3-5.  Additional 
applications  to,  the  buckling  of  imperfect  plates,  thermal 
convection,  the  robustness  of  population  dynamics  models, 
panel  flutter,  and  other  problems,  will  be  presented  in 
future  publications.  Generalizations  of  the  method  to 
multiple  bifurcation  points,  and  to  imperfections  and 
secondary  bifurcation  are  presently  being  investigated. 
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2 . Singular  Perturbations  of  Bifurcations. 

To  motivate  the  method  of  Matkowsky  and  Reiss  f 1 4 ] , we 
consider  the  algebraic  equation 

(2.1)  F [y; X , 6]  = y2  + (1  -X)y  - X6  = 0 . 

The  solutions  of  the  corresponding  bifurcation  problem 
(6  = 0)  are,  y ( A ) = 0,  and  y ( A ) = X - 1.  They  are  sketched 
in  Figure  2.1  with  the  solutions  of  (2.1)  for  representative 
positive  and  negative  values  of  6.  Thus  X = 1,  y = 0 is  the 
only  bifurcation  point  of  the  bifurcation  branches. 

As  6 + 0,  the  perturbed  branches  approach  the  bifurca- 
tion branches.  The  perturbed  branches  y(X,<S)  are  smooth 
functions  of  X for  each  6,  but  the  bifurcation  branches  y ( X ) 
have  a "corner"  at  the  bifurcation  point.  Thus  y(X,6)  + y(X) 
as  <5  -*■  0 non-uniformly  in  X,  in  any  interval  containing 
= 1.  Consequently,  (2.1)  is  a singular  perturbation  of 
th^j  corresponding  bifurcation  problem.  This  immediately 
suggests  using  the  method  of  matched  asymptotic  expansions 
to  obtain  uniformly  valid  asymptotic  expansions  of  the  per- 
turbed branches  as  { + 0. 

We  now  briefly  describe  this  method  for  the  general 
nonlinear  equation 

(2.2)  F [y ; X , 6 ] = 0 . 

The  corresponding  bifurcation  problem  is 

(2.3)  F [y ; X , 0 ] = 0 . 

We  assume  that  the  solutions  of  (2.2)  and  (2.3)  are 
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elements  of  a real  Hilbert  space,  and  that  F is  a nonlinear 
operator  on  that  space.  The  parameters  A and  6 are  assumed 
to  be  real,  and  the  bracket  notation  <•,*>  is  used  for  the 
inner  product. 

We  first  define  the  singular  points  of  any  bifurcation 
branch  y(A)  as  the  values  A = Aq  and  the  corresponding 
values  y (Aq)  for  which  the  linearized  problem 

(2.4)  Fy[y(A)  ;A,0]<|>  = 0 , <♦,♦>  = 1 , 

has  solutions.  The  linear  operator  F [y(A);A,0]  in  (2.4), 
is  the  Frechet  derivative  of  F[y;A,0].  The  significance  of 
the  singular  points,  in  the  present  context,  is  that  they 
are  the  points  at  which  non-uniform  convergence  is  possible. 
All  bifurcation  points  are  singular  points.  There  are  sin- 
gular points,  such  as  the  limit  points  shown  in  Figure  2.2, 
that  are  not  bifurcation  points.  With  reference  to  Figure 
2.2,  we  define  the  sub-branches  of  a bifurcation  branch  y(A) 
as  the  segments  connecting  two  successive  singular  points  of 
y(A).  Thus  the  bifurcation  branch  shown  in  Figure  2.2  has 
three  sub-branches. 

The  Outer  Expansions.  Let  Yq(A)  denote  a sub-branch, 
and  Xq  a singular  point  of  y^.  We  seek  asymptotic  expan- 
sions of  the  perturbed  branches  near  yQ  in  the  form 

oo 

(2.5)  y(A,6)  = ^ y. (A)6j  . 

By  substituting  (2.5)  into  (2.2)  and  equating  to  zero  the 
coefficient  of  each  power  of  6,  we  obtain  the  recursive 
system  of  linear  equations. 
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F°(X)Y;l(X)  = -F°(A)  = RX(X)  , 

12.6) 

Fy(A)y_.  (X)  = R.  (A)  , j = 2,3,...  . 

The  superscript  zero  denotes  the  operator  evaluated  at  6=0, 
e.g.  F°(A)  = Fy[y0  (A)  ,A;0]  . 

We  observe  that  Fy(A)  is  invertible  except  at  a sin- 
gular point  A = Aq.  In  [14],  we  studied  (2.6)  under  the  as- 
sumption that  there  is  an  element  ip*  in  the  null-space  of 
the  operator  that  is  adjoint  to  Fy(AQ),  such  that, 

(2.7)  I = <F°(Aq)  ,<p*>  / 0 . 

Then  it  follows  from  (2.6)  that  y^A),  and  hence  the  expan- 
sion (2.5),  is  not  defined  at  Aq.  In  fact,  |y(A,6)|  «°  as 

A -*■  Aq,  for  many  operators  F.  Thus  we  refer  to  (2.5)  as  the 
outer  expansion  corresponding  to  Yq ( A ) . 

Perturbations  caused  by  imperfections  that  satisfy 
(2.7)  "destroy"  bifurcation,  as  we  shall  show.  If  I = 0, 
then  the  imperfections  need  not  destroy  bifurcation.  This 
is  demonstrated  in  Section  4,  where  it  is  shown  for  a spe- 
cial problem,  that  the  imperfections  distort  the  bifurcation 
branches  without  destroying  bifurcation.  In  this  section  we 
shall  retain  assumption  (2.7).  A similar  analysis  is  re- 
quired if  (2.7)  is  violated. 

The  Inner  Expansions.  We  obtain  expansions  of  the  per- 
turbed branches  that  are  valid  near  A as  { + 0 by  first  in- 

o 

troducing  the  "stretching"  transformation 

CD 

A = Ao  + ?wa  + y]  ^(P3)1  , 


(2.8) 


f 


IMPERFECT  BIFURCATION  47 


where  y is  a new  small  parameter  that  is  defined  by 
(2.9)  |6(y) | = yb  . 


The  positive  constants,  a and  b,  are  to  be  determined.  Then 

we  obtain  asymptotic  expansions  of  y(A,6)  that  are  valid 

near  A in  the  form 
o 


(2.10) 


z (y)  = y (A  (y)  , 6 (y ) ) = 


:=c 


V 


where  Zq  = y(A  ).  The  coefficients  in  the  inner  expansion 
(2.10)  are  determined  in  the  usual  way  by  inserting  (2.10) 
into  (2.2).  This  leads  to  the  recursive  system  of  linear 
problems 


(2.11) 


F ( A ) z . = p . 

y o'  : 


j = 1,2, 


The  inhomogeneous  terms  p.  depend  on  a,  b,  z.,z9,...,z.,, 

3 -L  z 3“-L 

and  the  derivatives  of  A and  6 with  respect  to  y,  evaluated 
at  y = 0. 

Since  the  operator  F°(Ao)  is  singular,  the  p^  must 
satisfy  the  solvability  conditions, 


(2.12)  <P  j ,<!>*>  = 0 , j = 1,2,...  , 

where  <J> * is  in  the  null  space  of  the  adjoint  of  F°(Aq).  We 
have  assumed  that  the  null  space  is  one  dimensional  and 
spanned  by  <(>*.  Multidimensional  null  spaces  will  be  con- 
sidered in  a subsequent  publication.  An  analysis  of  (2.12) 
and  the  linear  problems  (2.11)  is  given  in  [14],  We  shall 
summarize  the  results  for  operators  F which  satisfy  the 
conditions 
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(2.13)  F°(A0)  = F°x(Ao)  0 . 

The  more  general  case  is  discussed  in  [14].  The  conditions 
(2.13)  are  satisfied  in  many  problems  at  the  bifurcation 
points,  but  not  at  limit  points. 

The  precise  form  of  the  inner  expansion  then  depends  on 
the  nonlinearity  at  the  singular  point.  If 


(2.14) 
then  | 6 | 

(2.15) 


Q = <(F°z(Ao) $)$,$*>  ? 0 , 
U2,  and 

z = zQ  + A(0<t>|6|1/2  + 0(6)  , 
A = Ao  + 5 | 6 | 1/2  + 0(6)  , 


where  the  amplitude  A (5)  is  a real  root  of  the  quadratic 
equation 

(2.16)  QA2  + 2?<F°x(Ao)<|>,4>*>A 

+ 2 (sgn  6)  <F°(Aq)  ,<()*>  = 0 , 

where  sgn  is  the  sign  function.  Thus  there  are  inner  expan- 
sions (2.15)  if  and  only  if  the  discriminant  of  (2.16)  is 
non-negative,  i.e. 

, 2 (sgn  6)Q<F°(A  ) ,$*> 

(2.17)  r > - 5 . 

<FZx*'**y 

Furthermore,  inner  expansions  occur  in  pairs,  and  they  co- 
alesce when  the  equality  holds  in  (2.17).  Condition  (2.14) 
characterizes  the  nonlinearity  near  Aq  as  quadratic. 

Higher  order  terms  in  the  expansions  can  be  obtained  by 
analyzing  (2.11)  and  (2.12).  The  amplitudes  Aj  of  4>  in  each 
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Zy  for  j > 1,  are  then  solutions  of  linear  inhomogeneous 
algebraic  equations.  The  solvability  conditions  for  these 
equations  determine  the  parameters  £2,£2,...  • 

If  Q = 0 and 

(2.18)  C = <{((F°  (A  U)<f>)4>  + 4 (F®  ( A ) <f>)  Z_ 

z z z o z z o z 

+ 2 (F°z(Ao)z2) $},$*>  ^ o, 
where  Z2  is  defined  by 

(2.19)  Z2  1 - T P°1Ao»"1[(F2Z»>*>  ■ 

then  the  nonlinearity  near  Aq  is  cubic.  Then  | 6 | = p3  and 
the  inner  expansions  are  given  by 


0 

(2.20) 

A = A 

o 

The  amplitudes  A(£) 
cubic  equation, 


+ A(U<H<5|1/3  + 0(62/3)  , 

+ 4 | <5 1 2/3  + 0(6)  . 

are  the  real  roots  of  the  canonical 


(2.21)  CA3  + 65<(F°x(Ao)<fr),.j>*>A 

+ 6 (sgn  6)<F°(Ao)  ,<(>*>  = 0 . 

Thus  there  may  be  one,  two,  or  three  inner  expansions  at  a 
singular  point  with  cubic  nonlinearity,  depending  on  the 
values  of  £. 


If  C = 0,  then  quartic  and  higher  order  nonlinearities 
near  Aq  must  be  analyzed  to  determine  the  inner  expansions. 
Thus,  a secondary  result  of  this  method  for  constructing 
asymptotic  expansions  of  the  perturbed  branches,  is  a formal 
classification  of  the  singularities  of  general  nonlinear 
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equations.  Thom  [17]  has  rigorously  classified  the  singu- 
larities of  a special  class  of  equations,  using  different 
methods . 

We  observe  from  (2.15)  and  (2.20)  that  A - Aq  is 
0(|6|^2)  for  quadratic  nonlinearities,  and  0(|6|2/^)  for 
cubic  nonlinearities.  Thus,  the  effects  of  imperfections 
are  more  severe  for  quadratic  than  cubic  nonlinearities. 
Since  quadratic  nonlinearities  usually  correspond  to  trens- 
critical  bifurcation,  this  conclusion  is  in  qualitative 
agreement  with  experimental  observations. 

Matching.  There  are  several  sub-branches  of  the  bifur- 
cation problem  emanating  from  each  singular  point.  For  each 
sub-branch  there  is  an  outer  expansion,  and  there  are 
several  inner  expansions  for  each  of  its  singular  points. 

To  obtain  uniform  asymptotic  expansions  of  the  perturbed 
branches,  it  is  necessary  to  appropriately  connect  the  outer 
and  inner  expansions.  These  connections  are  obtained 
systematically  by  the  matching  procedure  of  the  method  of 
matched  asymptotic  expansions  [18-20].  The  matching  will  be 
illustrated  for  the  specific  problem  in  Section  3.  A dis- 
cussion of  the  procedure  for  the  general  problem  (2.2)  is 
given  in  [14] . 

Finally,  we  wish  to  remark  that  the  inner  and  outer  ex- 
pansions are  completely  determined  by  the  analysis.  This  is 
in  contrast  to  the  usual  applications  of  the  method  of 
matched  asymptotic  expansions,  where  the  inner  and  outer  ex- 
pansions contain  undetermined  constants  or  functions.  The 
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matching  conditions  then  evaluate  these  undetermined 
quantities . 

Sketches  of  typical  inner  and  outer  expansions  at  the 
bifurcation  point  X^  are  given  in  Figure  3.1  for  a cubic 
nonlinearity  and  supercritical  bifurcation.  The  curves  de- 

I 

noted  by  B are  bifurcation  sub-branches.  The  heavy  curves 
denoted  by  0 are  the  outer  expansions.  The  dashed  curves 
show  the  inner  expansions.  The  composite  expansions  are 
sketched  in  Figure  3.2.  These  results  are  obtained  in 
Section  3. 

3 . Buckling  of  Crooked  Columns. 

We  now  generalize  the  Elastica  theory  to  account  for 
imperfections  in  the  column.  We  assume  that  the  initial  un- 
stressed axis  is  a specified  curve.  The  angle  between  the 
tangent  to  this  curve  and  the  horizontal  is  ^ (x) . Then,  as 
we  can  show,  the  boundary  value  problem  (1.1)  is  modified  to 

(3.1)  <|>"  + X sin  <J>  = 6g(x)  , * ( 0 ) = <*>'(1)  = 0 , 

where  6g(x)  = 4>j' (x)  and  4>j  (0)  = 4)^(1)  = 0.  We  shall  not 
present  the  boundary  value  problems  for  u and  w correspond- 
ing to  (1.2).  Thus  the  imperfection  appears  as  a specified 
"forcing  function"  in  the  differential  equation.  Here,  6 is 

the  magnitude  of  the  imperfections,  e.g.  6 = max  |4>"(x)|, 

0<x<l  1 

and  g(x)  is  the  imperfection  shape  function.  For  simplicity, 
we  shall  assume  6 > 0. 
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We  obtain  the  bifurcation  problem  (1.1)  by  setting 
6 = 0 in  (3.1).  It  can  be  solved  explicitly  in  terms  of 
elliptic  functions  [1-3] . We  shall  be  concerned  only  with 
the  solution  branching  from  the  lowest  bifurcation  point 
A = A^.  The  bifurcation  points  (1.3)  are  the  eigenvalues  of 
the  bifurcation  problem  linearized  about  4>  = 0.  The  nor- 
malized eigenfunctions  are  4>n(x)  = cos  nnx. 

There  are  four  bifurcation  sub-branches  emanating  from 
A^.  They  are  the  two  sub-branches  <)> Q = 0,  to  the  left  and 
right  of  A^,  and  the  two  supercritical  bifurcation  sub- 
branches = tV,  branching  above  and  below  A^J  see 
Figure  1.1. 

To  demonstrate  the  application  of  the  method  [14]  to 
problems  where  the  bifurcation  branches  cannot  be  obtained 
explicitly,  we  approximate  them  near  A by  the  standard 
perturbation  method;  see  e.g.  [21].  If  we  define  the 
amplitude  parameter  e by 


(3.2) 


then  we  obtain 


(3.3) 


A 


e2  = | >F2dx 
0 

2 

Ax[l  + V + 0(e3)]  ' 

+ 0 (e3)  . 


Let  <}>0(x;A)  denote  any  one  of  the  four  bifurcation  sub- 
branches. Then  the  outer  expansions  of  the  perturbed 
branches  are 


(3.4) 


<Mx;A,6) 


£ 


4> . (x ; A ) 6 


j 
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By  substituting  (3.4)  into  (3.1),  we  find,  for  example,  that 
is  determined  by 

(3.5)  + X cos^q)^  = g (x)  , 4>[(0)  = <J>^(1)  = 0 . 


We  solve  (3.5)  by  an  eigenfunction  expansion,  using  the  nor- 
malized eigenfunctions  and  eigenvalues  o^,  that  are  de- 
fined for  m = 1,2,...,  by 

(3.6)  v"  + A (cos  <f>n)v  = o v , v ’ ( 0 ) = v ' ( 1 ) = 0 . 

m 0 m m m m m 

<v„ , v > = 1 . 
m m 


Then  by  assuming  that  g(x)  has  the  representation 


(3.7)  g (x) 


g vm(x)  , 
m m 


g = <g , v > 


we  obtain  the  solution  of  (3.5)  as 

oo 


(3.8) 

that 

(3.9) 


m=l  m 


m 


For  the  two  sub-branches  <f>0  = 0,  we  obtain  from  (3.6) 


A - A 


and  v ♦m<x>  ' m = 1,2, 


m m 

Thus,  by  (3.8)  and  (3.9),  the  outer  expansions  (3.4)  for  the 


sub-branches  4>g  = 0 are 


(3.10) 


o 

9m 

A"  - X 


m 


m 


6 + 0(6^)  , 


where  the  coefficients  g°  are  given  by 

(3.11)  g°  = <g,4>m>  , m = 1,2,...  . 

Thus  if  g°  + 0,  which  is  equivalent  to  the  condition  (2.7), 
then  the  outer  expansions  (3.10)  are  unbounded  as  A ♦ A^. 
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In  particular,  if  g°  < 0,  then  (i  + » as  )i  ♦ H from  below. 
The  sign  of  <fi  is  switched  if  A -*■  A^  from  above.  Sketches  of 
these  outer  expansions  are  indicated  by  the  curves  Oq  in 
Figure  3.1. 

For  the  sub-branchas  <f>Q  = ±T  given  by  (3.3),  the  coef- 
ficients in  the  differential  equations  (3.6)  for  vm  are 
analytic  functions  of  e.  Therefore  we  determine  the  eigen- 
values and  eigenfunctions  in  the  form 


(3.12)  vm(x;e) 


v 

m,3 


(x)ej  , 


°m(e) 


o . e 
m,3 


j 


m — 1 , 2 , . . • . 

By  analyzing  the  resulting  system  of  linear  problems,  we 
obtain 


v = 6 + 0(e)  , 

m m 


(3.13) 


X1  2 
~T 


o,  = - e‘  + 0(e  ) , 


o = A.  - A + 0(e)  , 
m l m 


ni  — 1 f 2 f • • • t 


m = 2,3,... 


Substituting  these  results  in  (3.7)  and  (3.8),  and  then  into 


(3.4),  we  finally  obtain, 
(3.14)  <J,  = + 


r 2g?<t 


e m=2  A,  (1-m  ) 


^m  + 0(E) 


+ 0(6)  . 


The  two  outer  expansions  given  by  (3.14)  are  unbounded  as 
e -*•  0 (A  -*•  A^)  if  g°  f 0-  They  are  sketched  in  Figure  3.1 
for  g°  < 0,  and  they  are  labelled  as  O^.  The  bifurcation 
branch  is  the  light  curve. 


IMPERFECT  BIFURCATION 


55 


The  Inner  Expansions.  In  the  notation  of  Section  2, 

F = -X  sin  z,  and  F = -X  cos  z,  for  the  modified 
zz  zzz 

Elastica  (3.1).  Thus  it  follows  from  (2.14)  and  (2.18)  that 
Q = 0,  C / 0,  and  the  nonlinearity  is  cubic  at  the  bifurca- 
tion point  X = X ^ . Then,  the  inner  expansions  are  of  the 
form 


(3.15a) 

X = \1  + £w2  + 0 (p3)  , 

oo 

(3.15b) 

<f>(x;  X (y)  , 6 (p) ) = z(p)  = ^ \ 

zj  w3  » 

where  6 

= p3.  By  inserting  (3.15)  into 

(3.1),  we  get 

(3.16a) 

Lz^  = z£  + X^z^  = 0 , 

z'(0)  = z^(l)  = 

0 ; 

(3.16b) 

o 

II 

<N 

N 

A 

z2 (0)  = z'(l)  « 

0 ; 

(3.16c) 

L23  = ^1Z1  " 6CZ1  + 6{|  ' 

z’(0)  = z}(l)  ■= 

0 . 

Equations  (3.16a,b)  imply  that 

(3.17)  z1  = A4>x  , z2  = B<J>2  . 

From  the  solvability  condition  for  (3.16c),  the  ampli- 
tude A must  satisfy  the  cubic, 

3 4 ^1 

(3.18)  A — t — £A  + 4 t — = 0 • 

A1  A1 

This  equation  has  a real  root  A1 ( C ) for  all  £.  It  is  posi- 
tive (negative)  if  g^  is  negative  (positive) . Furthermore 
we  have 


I Ax  (£)  | -*•  00  , 

A1(5)  - 0 , 


as  £ 


00 


I 


(3.19) 


as  £ •* 


— oo 
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A^(£)  is  the  unique  root  of  (3.18)  if 


La?  1/3 


5 < 50  = 3 


If  £ exceeds  £0»  then  there  are  two  additional  roots  A2  (£) 
and  A 3 ( C ) - Their  sign  is  opposite  to  the  sign  of  A^ . In 


addition,  these  roots  satisfy 


(3.20) 


A2  (?)  -*•  0 , and  |A3(U  | - 

W " W ‘ 


as  ( + 


Thus,  for  each  real  root  of  (3.18)  we  have,  from  (3.15), 
(3.17),  and  6 = y3,  an  inner  expansion 


X = X1  + C.6  + 0(6)  , 


(3.21) 


(i)  = A.(a<^161/3  + 0(62/3)  , i = 1,2,3  . 


Matching  Conditions.  There  are  four  outer  expansions 
given  by  (3.10)  and  (3.14),  and  one,  two,  or  three  inner  ex- 
pansions given  by  (3.21)  corresponding  to  the  bifurcation 
point  X^.  The  matching  conditions  of  the  method  of  matched 
asymptotic  expansions  determine  how  these  expansions  are 
connected. 

The  matching  conditions  are  obtained  by  expressing  the 

outer  expansions,  given  generically  by  (3.4),  near  X^  in 

3 

terms  of  the  inner  variables  X(y),  and  6(y)  = y . This  gives 


(3.22) 


♦ (x;  X (y)  ,6  (y) ) 


$ j (x; X (y ) ) y' 


Then  (3.22)  is  expanded  in  a power  series  in  y as 


(3.23) 
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This  gives  a representation  of  the  outer  expansions  near  X^. 
For  the  outer  expansions  (3.10),  = (g ^/C)^,  and  for  the 

outer  expansions  (3.14),  we  have 

(3.24)  nx  = [>2(C/X1)1/2  - g°/(250<t'1  • 

We  have  used  (3.3)  to  obtain  (3.24). 

The  matching  conditions  are  derived  by  assuming  that 
there  is  a common  interval  in  X,  that  approaches  X^  as 
p -*■  0,  in  which  the  outer  expansions  (3.2  3)  and  the  inner 
expansions  (3.15b)  are  valid.  It  then  follows  that 

(3.25)  lim  (n.  - z,  ) = 0 , for  k = 1,2,...  . 

I?  I — 

Discussions  of  matching  are  given  in  [14,18,19],  and 
elsewhere . 

To  apply  (3.25)  we  require  the  asymptotic  expansions  as 
|C|  + ">  of  z^U).  Since  the  amplitude  A of  is  a root  of 
(3.18),  we  have,  using  (3.19)  and  (3.20),  that 


lim  A. 

= lim  A,  = 

2(e/v 

^-►oo 

£-*■00  J 

lim  A? 

= lim  A. 

= gj/c  • 

£+00  z 

£-♦•—00 

The  matching  condition  (3.25)  with  k = 1 then  shows  that  the 
inner  and  outer  expansions  match  as  indicated  in  Figure  3.1. 
The  inner  expansions  are  represented  by  dashed  lines. 

The  uniform  expansions  are  sketched  in  Figure  3.2.  The 
upper  branch  in  Figure  3.2  is  in  qualitative  agreement  with 
column  buckling  experiments. 
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4 . Weak  Imperfections. 


The  analysis  in  the  preceding  section  is  not  valid  if 
g°  = 0;  that  is,  if  the  imperfections  have  no  projection 
into  the  eigenfunction  of  the  bifurcation  point.  This  is 
equivalent  to  the  condition  (2.7)  for  the  general  nonlinear 
equation  analyzed  in  Section  2.  We  now  modify  the  analysis 
of  Section  3,  and  consider  imperfections  with  g°  = 0. 

For  g°  = 0,  the  four  outer  expansions  are  given  by, 
c.f.  (3.10)  and  (3.14), 

(4.1)  <t>  = U (x ; A ) 6 + 0(62)  , for  <|>0  = 0 , 

(4.2)  <f>  = ±Y  + U(x;A1)6  + 0(62)  , for  <f>0  = tV  , 


where  U is  defined  by 

(4.3)  U (x ; A ) = 


m 


J - A 'm 
m 


<t>  (x)  . 


Since  U is  bounded  as  A -*•  A^,  the  leading  term  of  the  outer 
expansion  (4.1)  is  bounded.  The  higher  order  terms  may  be 
unbounded  as  A ■+  A^ . 

An  analysis  of  the  inner  expansions  and  the  matching 
conditions,  which  we  do  not  present,  shows  that  6 = p. 
Furthermore  A ( y ) and  z(x;u)  have  the  power  series 
representations 


(4.4) 


A (y ) = A1  + 56  + 0(6Z)  , 
z(x;u)  = + 0 ( 6 2 ) 


There  is  only  one  inner  expansion  (4.4),  and  it  matches  with 
the  two  outer  expansions  (4.1)  corresponding  to  the 
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bifurcation  solution  = 0.  Then  the  composite  expansion 
of  (4.1)  and  (4.4)  is  (see  Section  2), 

(4.5)  $ = <t>*  B U (x ; X ) 6 + 0(62)  . 

If  g°  / 0,  then  the  outer  expansion  (4.1)  is  unbounded  as 
X Perturbations  of  bifurcations  occur  at  X ^ similar 

to  those  at  X^  when  g°  ^ 0. 

We  now  demonstrate  that  when  g°  = 0,  there  are  solu- 
tions of  the  perturbed  problem  (3.1)  that  bifurcate  from  the 
perturbed  solution  (4.5).  That  is  perturbations  of  bifurca- 
tions of  the  Elastica  with  g°  = 0 maintain  the  bifurcation. 
This  result  may  not  be  true  for  more  general  nonlinear  prob- 
lems. We  linearize  (3.1)  about  the  perturbed  solution 
4>*(x;X,6)  given  by  (4.5)  to  obtain 

(4.6)  v"  + X (cos  <f>*)v  = 0 , v'  (0)  = v'  (1)  = 0 , 

<v,v>  = 1 . 


The  bifurcation  points  of  <f>*  are  the  values  of  X(6)  for 
which  (4.6)  has  solutions  v(x;6)  ^ 0.  We  observe  that  for 
6 = 0,  = 0,  and  (4.6)  is  reduced  to  the  linearized  prob- 

lem of  the  Elastica.  Consequently,  we  have  X (0)  = X^  and 
v(x;0)  = ( x ) . Thus  for  6 / 0,  we  seek  solutions  of  (4.6) 

near  X = X^  in  the  form 


2 2 

X = X^  + A^6  + ~2~  6 + . . . , 

(4.7)  v 

v = + V;l6  + 62  + ...  . 


We  insert  (4.7)  into  (4.5)  and  (4.6)  to  obtain  sequence  of 
linear  problems  to  determine  the  coefficients  in  (4.7). 
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The  first  two  problems  are, 

(4.8a)  LVj  = + A1v1  = • vj(0)  = vj  (1)  = 0 , 

<V1  / 4*  1>  = 0 ; 

(4.8b)  Lv2  = -A241  + A 14>  x [U  (x;  A)  ] 2 - 2AlVl  , 

v2  (0)  = v^(l)  = 0 , <4>1,v2>  + <v1,v1>  = 0 . 

The  solvability  condition  applied  to  the  inhomogeneous  term 
in  (4.8a)  yields  = 0,  and  hence  (4.8a)  implies  that 
= 0.  Then  the  solvability  condition  applied  to  (4.8b) 
gives 

1 

(4.9)  A2  = ^ U2(x;X1)<t>^(x)dx  > 0 . 

0 

The  coefficient  A2  can  be  explicitly  evaluated  in  terms  of 
the  imperfection  coefficients  g°,  by  inserting  U from  (4.3) 
into  (4.9)  and  evaluating  the  resulting  integrals. 

Thus  the  bifurcation  point  A = A^  of  the  Elastica  theory 
is  shifted  to 

(4.10)  A (6)  = A1  + (A2/2)62  + 0(63)  , 

which  is  a bifurcation  point  of  the  perturbed  solution  (4.5). 
Since  A2  > 0,  it  is  shifted  to  the  right  of  A^.  The  per- 
turbed solution  that  bifurcates  from  (4.10)  can  be  obtained 
by  the  standard  perturbation  method  in  a new  amplitude 
parameter.  Equivalently  it  can  be  obtained  by  an  appro- 
priate analysis  of  the  outer  expansion  (4.2). 
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5 . Forced  Oscillations  as  a Singular  Perturbation 
of  Bifurcation. 

The  forced  nonlinear  oscillations  of  many  one-degree- 
of-freedom  systems  are  described  by 

(5.1)  X2y"  + y + f(y)  - 6g(t)  = 0 , 

where  fig  (t)  is  the  forcing  function,  and  the  restoring  force 
f (y)  is  a nonlinear  function  that  satisfies 

(5.2)  f (0)  = f ' (0)  =0  . 

The  amplitude  fi  of  the  forcing  function  is  assumed  to  be 
small,  g(t)  is  a prescribed  periodic  function  of  period  2tt, 
t is  a scaled  time  variable,  and  X is  the  forcing  frequency 
in  the  original  (unsealed)  time.  The  forcing  function  is 
treated  as  an  imperfection  of  the  bifurcation  problem 

(5.3)  X2y " + y + f(y)  = 0 , 

which  describes  the  free,  nonlinear  oscillations  of  the 
system.  We  apply  the  method  outlined  in  Section  2 to  obtain 
periodic  solutions  of  (5.1)  of  period  2v  that  are  near 
periodic  solutions  of  period  2ir  of  (5.3).  Thus  we  solve 
(5.1)  and  (5.3)  subject  to  the  periodicity  conditions, 

(5.4)  y (0 ) = y (2n ) , y'  (0)  = y'  (2u)  = 0 . 

For  simplicity,  we  shall  assume  that  6^0. 

We  observe  that  y = 0 is  a periodic  solution  of  (5.3) 
for  all  values  of  X.  Other  periodic  solutions  of  period  2ir 
of  (5.3)  branch  from  y = 0.  They  are  obtained  by  standard 
perturbation  methods  as 
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y 

(5.5) 

A 


2 f"  (0|  -j 

e cos  t + e — g — -[-3  + 2 cos  t + cos  2t]  + 0(e  ) 
1 + A ( 2 j e 2 + 0 (e3) 


where  the  small  parameter  e is  defined  by  e = y(0),  and  ^(2) 
is  defined  by 

(5.6)  A(2)  = ^ {-5 [f " (0) ] 2 + 3f (0) } . 


The  bifurcation  is  supercritical  (subcritical)  from  the  bi- 
furcation point  A = 1 if  ^2)  > 0 (<  0)  . If  ^2)  = then 
higher  order  terms  must  be  calculated  to  determine  the  di- 
rection of  bifurcation.  For  subcritical  bifurcation,  the 
frequency  A of  the  free  oscillations  decreases  with  ampli- 
tude, i.e.  the  nonlinearity  is  "soft". 

There  are  four  bifurcation  sub-branches  yg  emanating 

from  the  bifurcation  point.  There  is  an  outer  expansion 

corresponding  to  each  sub-branch.  They  are  obtained  by 

applying  the  method  discussed  in  Sections  2 and  3.  If  we 

assume  that  the  forcing  function  is  given  by, 

“>  2n 


■s 


(5.7)  g(t)  = / , g^cosmt  , 


m 


= i j g(t)cosmtdt  , 
0 


then  the  outer  expansions  are  given  by 


(5.8) 


y = 


T 


m 


TT 

m=l  1 - A m 


cosmt 


6 + 0(6Z) 


for  Yq  - 0#  and  by 

(5.9)  y = e cos  t + 0(e2) 


I-* 

L—  ( 2 ) 1 


cos  t + 0(1) 


6 + 0(6) 


for  yQ  given  by  (5.5).  Thus  the  outer  expansions  are  un- 
bounded at  A = 1 if  g^  / 0.  We  shall  assume  that  g^  ^ 0. 


I 

< 
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If  = 0,  an  analysis  analogous  to  Section  4 is  required. 

Similarly,  it  can  be  shown  that  the  inner  expansions 
are  given  by 

(5.10)  6 = w3  , X = 1 + £62/3  + 0(6)  , 

z = (A(Ocost)61/3  + 0(62/3)  , 

where  A(£)  is  a real  root  of  the  cubic 

3 4 F 2gl 

(5.11)  AJ  - A - t — = 0 . 

A(2)  A (2 ) 

The  matching  conditions  then  determine  how  to  connect  the 
inner  and  outer  expansions.  The  resulting  uniform  expansions 
are  sketched  in  Figure  5.1  for  subcritical  bifurcation  with 

g^  < 0.  The  singular  point  X = 1,  y = 0 of  the  free  oscilla- 

. ' . 2/3 

tions  is  shifted  to  the  singular  point  at  X = 1 + £q6  ' 

+ 0(6)  of  the  perturbed  problem. 
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1 . Introduction 

In  this  article  we  examine  both  the  local  and  global 
behavior  of  bifurcating  solutions  of  boundary  value  problems 
for  the  quasilinear  systems  of  ordinary  differential  equa- 
tions that  describe  the  equilibrium  states  of  nonlinearly 
elastic  rods  and  shells.  We  are  particularly  interested  in 
examining  the  critical  role  of  material  behavior  in  deter- 
mining the  nature  of  solutions. 

We  limit  our  attention  to  geometrically  exact  theories 
of  nonlinearly  elastic  structures . To  explain  what  is  meant 
by  such  theories  and  to  distinguish  them  from  the  far  more 
intensively  cultivated  nonlinear  technical  theories  of  elas- 
tic structures,  it  is  first  worthwhile  to  trace  the  high- 
lights of  the  historical  development  of  theories  of  elastic 
structures . 

The  elastica  is  the  simplest  geometrically  exact  model 
for  the  planar  equilibrium  of  a flexible  rod.  The  configur- 
ation of  an  elastica  is  described  by  an  inextensible , smooth 
plane  curve.  The  theory  of  the  elastica,  which  was  develop- 
ed by  Jas.  Bernoulli,  D.  Bernoulli,  and  Euler  in  the  first 
half  of  the  eighteenth  century  (cf.  [59]),  rests  upon  the 
fundamental  law  that  the  resultant  force  and  moment  on  every 
piece  of  the  rod  must  be  zero  for  the  rod  to  be  in  equilib- 
rium and  upon  the  constitutive  assumption  that  the  bending 
moment  is  proportional  to  the  curvature.  If  the  only  loads 
applied  to  the  rod  are  compressive  terminal  forces  (which 
must  be  collinear  and  of  equal  magnitude  for  equilibrium) , 
then  the  equilibrium  of  the  elastica  is  governed  by  the 
semilinear  ordinary  differential  equation 

(1.1)  [B (s) 0 ' ] ' + p sin e = 0, 

where  s is  the  arc  length  parameter  of  plane  curve  speci- 
fying the  elastica' s configuration,  the  prime  denotes  dif- 
ferentiation with  respect  to  s,  0(s)  is  the  angle  between 
the  tangent  to  the  curve  at  s and  the  direction  of  the  ap- 
plied load,  p is  the  prescribed  magnitude  of  the  applied 
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load,  and  B is  the  prescribed  bending  stiffness  (B  > 0) . 
That  no  geometrical  approximations  are  made  in  the  formula- 
tion of  this  equation  is  responsible  for  the  presence  of  the 
nonlinear  function  sin  in  (1.1)  and  for  the  characteriza- 
tion of  (1.1)  as  geometrically  exact.  Although  (1.1)  is 
nonlinear,  it  does  not  describe  the  behavior  of  a nonlinear- 
ly  elastic  rod  because  the  constitutive  equation  M(s)  = 

B ( s ) 9 1 (s)  , which  gives  the  bending  moment  M as  a function 
of  the  curvature  0',  is  linear.  If  we  replace  this  rela- 
tion with  the  nonlinear  constitutive  equation  M(s)  = 
M(9’(s),s)  where  M is  prescribed,  then  corresponding  to 

(1.1)  we  have  the  quasilinear  equation 

(1.2)  [M  ( 9 ' , s ) ] 1 + p sin  0 = 0, 

which  describes  a nonlinearly  elastic  rod.  It  is  interest- 
ing to  observe  that  equations  like  (1.2)  were  not  studied 
in  mechanics  until  the  last  decade.  As  we  shall  show  in 
Section  2,  the  solutions  of  boundary  value  problems  for 
equations  like  (1.2)  may  differ  markedly  from  those  for 
(1.1)  . 

The  great  influence  of  the  elastica  theory  embodied  in 

(1.1)  may  be  attributed  to  several  factors: 

(i)  Euler's  demonstration  in  1744  of  the  tractability 

of  (1.1)  by  his  brilliant  qualitative  analysis  [27] 
of  all  the  solutions  of  this  equation  when  B = 
const . 

(ii)  The  ability  of  nineteenth  and  twentieth  century 
scientists  to  solve  boundary  value  problems  for 

(1.1)  with  B = const.  in  terms  of  elliptic  func- 
tions . 

(iii)  The  central  role  played  by  linearized  versions  of 

(1.1)  in  structural  engineering. 


In  1859 , Kirchhof f [36]  showed  how  to  generalize  (1.1) 
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to  account  for  the  combined  flexure  and  torsion  of  an  inex- 
tensible  rod  in  space.  The  resulting  theory  yields  a system 
of  equations  completely  analogous  to  those  governing  the 
motion  of  a heavy  rigid  body  with  one  point  fixed.  This 
theory,  called  Kirchhoff's  kinetic  analogy,  is  certainly 
tractable  when  it  reduces  to  any  of  the  classical  integrable 
cases.  The  most  important  integrable  case  corresponds  to 
that  in  which  the  cross-section  of  the  rod  has  two  axes  of 
symmetry.  In  this  case  the  governing  equations  may  be 
solved  in  terms  of  elliptic  functions.  Moreover  this  case 
(as  well  as  major  generalizations  of  it)  can  be  treated 
qualitatively  along  the  lines  laid  down  by  Euler  (cf.  [12]). 
Kirchhoff's  kinetic  analogy  is  geometrically  exact  as  is 
strongly  suggested  by  the  welter  of  trigonometric  functions 
that  appear  in  its  governing  equations. 

The  historical  development  of  nonlinear  plate  and  shell 
theories  has  not  been  as  felicitous  as  that  of  rod  theories 
up  to  the  creation  of  Kirchhoff's  kinetic  analogy. 

Kirchhoff  established  a satisfactory  linear  theory  of  plates 
begun  by  Navier.  The  development  of  a geometrically  exact 
theory  was  no  doubt  hampered  by  difficulties  of  describing 
surface  geometry  and  strain  that  do  not  arise  in  rod  theo- 
ries, by  the  inappropriateness  of  constraints  such  as  inex- 
tensibility for  plate  and  shell  theories,  and  by  a lack  of 
interest  occ  sioned  by  the  likelihood  thac  the  resulting 
nonlinear  equations  would  be  mathematically  intractable. 
(Until  recently,  this  pessimism  was  well  justified.) 

A popular  nonlinear  plate  theory  was  finally  developed 
by  v.  Karman  [32]  in  1910.  In  this  theory,  the  vertical 
deflection  w and  the  stress  function  <p  of  a plate  sub- 
ject to  a planar  load  on  its  edge  with  a load  parameter  X 
is  governed  by  the  semilinear  system  of  partial  differen- 
ial  equations 


A Aw  = 
A A<p  = 


(1.3) 


X [f  ,w]  + [ «p  ,w]  , 
- [w,w]. 
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where  AA  is  the  biharmonic  operator,  [u,v]  = u v + 

xx  yy 

~ 2u  v , anc*  f is  a given  function.  These  equa- 
tions  are  "derived"  by  introducing  into  the  standard  proce- 
dures of  mechanics  a number  of  geometric  approximations  that 
are  roughly  analogous  to  the  replacement  of  the  sin  in 
(1.1)  by  its  cubic  approximation.  Consequently  this  theory 
is  not  geometrically  exact  (as  is  suggested  by  the  absence 
of  trigonometrical  functions  from  (1.3)).  Moreover  (1.3)  is 
based  upon  a certain  "linear"  stress-strain  law.  For  these 
reasons,  this  theory  has  never  been  embedded  in  a simple, 
rational  (albeit  formal)  approximation  scheme.  Nevertheless 
the  appearance  of  the  familiar  biharmonic  and  Monge-Ampere 
operators  have  caused  this  equation  and  its  various  related 
shell  theories  to  attract  nearly  all  the  serious  mathemati- 
cal attention  bestowed  on  nonlinear  problems  of  plates  and 
shells  in  recent  years.  The  popularity  of  this  theory  is 
somewhat  paradoxical:  Its  inadequacy  as  a rational  theory 
of  mechanics  is  not  compensated  for  by  analytic  simplicity. 
Indeed  many  studies  of  (1.3)  have  demanded  considerable 
mathematical  innovation  as  well  as  technical  expertise.  (Cf. 
[19,21,62],  e.g.)  Because  of  the  geometrical  approximations 
made  in  equations  like  (1.3)  their  global  analyses  are  de- 
void of  physical  interest. 

During  the  first  half  of  the  twentieth  century  the 
problem  of  developing  nonlinear  rod  theories  that  account 
for  such  effects  as  extensibility  and  shear  in  addition  to 
flexure  has  not  resulted  in  refinements  of  the  beautiful 
theories  of  Euler  and  Kirchhoff,  but  rather  to  developments 
along  the  lines  laid  out  by  v.  Karman.  (A  survey  of  such 
theories  has  been  given  in  [26].) 

Advances  in  continuum  mechanics  since  the  Second  World 
War  have  led  to  the  development  in  the  last  few  years  of 
geometrically  exact  models  of  nonlinearly  elastic  rods, 
plates,  and  shells.  (Cf.  [3,8,44].  Like  (1.2)  these  models 
are  characterized  by  quasilinear  differential  equations  in- 
volving arbitrary  constitutive  functions.  Recent  advances 
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in  analysis  have  made  at  least  some  of  these  problems  trac- 
table (in  perhaps  a somewhat  different  sense  from  that 
understood  up  to  the  middle  of  the  twentieth  century.)  As 
we  shall  show,  the  underlying  physics  prevents  the  resulting 
mathematical  problems  from  being  readily  subsumed  under  the 
available  nonlinear  operator  theories. 

The  arbitrary  constitutive  functions  that  appear  in  the 
equations  for  nonlinearly  elastic  structures  (like  M in 
(1.2))  may  be  regarded  as  parameters  in  an  infinite  dimen- 
sional space.  (In  (1.1)  with  B = const.  there  is  but  one 
essential  parameter:  p/B.  Geometric  and  material  parame- 
ters have  been  suppressed  in  (1.3).)  By  allowing  the  con- 
stitutive functions  to  vary  within  physically  reasonable 
limits,  we  can  use  the  bifurcation  process  not  only  as  an 
end  in  itself  but  also  to  illuminate  the  nature  of  material 
behavior.  Indeed,  we  shall  show  that  solutions  of  problems 
for  nonlinearly  elastic  structures  may  well  exhibit  impor- 
tant new  effects,  which  more  than  compensate  for  the  analy- 
tic difficulties  presented  by  these  theories. 

2 . Planar  Buckling  of  Straight  Rods 

In  this  section  we  generalize  the  elastica  theory  to 
account  for  extension  and  shear  as  well  as  flexure  and  to 
permit  a large  class  of  nonlinear  constitutive  relations. 

We  show  that  the  richness  of  this  theory  is  reflected  in 
the  richness  of  the  modes  of  behavior  of  the  solutions. 

Let  (i,^,k)  be  an  orthonormal  basis  for  E2 3.  The 
configuration  of  a rod  undergoing  a planar  deformation  is 
given  by  the  pair  of  vector  functions 

(2.1)  s f [0,1]  I — ►r(s),  b (s)  ? span{i,^},  |b(s)|  = 1. 

(s  may  be  taken  as  the  (scaled)  arc  length  parameter  of 
the  line  of  centroids  of  the  rod  in  its  reference  configu- 
ration, r as  giving  the  deformed  configuration  of  this 
curve,  and  b as  characterizing  the  deformed  configuration 
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of  a cross-section.  See  Fig.  2.1.)  We  denote  differentia- 
tion with  respect  to  s by  a prime.  We  define  9 and  a 
by 

(2.2a)  a(s)  = cos  6 (s)i  + sin  9(s)j, 

(2.2b)  b(s)  = ~sin  9(s)i  + cos  9 (s)^. 

(This  9 gives  the  orientation  of  b;  it  need  not  be  the 
same  as  the  9 of  (1.1).)  We  define  strains  £,  and  g by 

( 2 .3)  r'  = (l+£ ) a + gb. 

We  require  that 

(2.4)  1 + E,  = r'-a  > 0 

~ ~ a&z 

so  that  jr' | > 0 and  so  that  b(s)  is  never  collinear 

with  r' (s) . (More  stringent  restrictions  are  geometrically 
appropriate.  Cf.  [8,10].  For  simplicity  we  do  not  describe 
them  here.)  The  strains  for  our  problem  are 

(2.5)  w = (£  »g  / M-  = 9 ' ) 9 E = (-1,»)  xJRxE. 

Let  n(s)  and  m(s)  denote  the  resultant  force  and 
resultant  couple  applied  to  [0,s]  by  (s,l).  If  the 
only  loads  applied  to  the  rod  are  terminal,  then  the  equi- 
librium equations  are 

(2.6a,b)  n'  = 0,  m'  + r'  * n = 

In  consonance  with  the  assumption  of  planarity  we  decompose 
n and  m thus: 

'V 

(2.7a,b)  n(s)  = S(s)a(s)  + H(s)b(s),  m(s)  = M(s)k. 

Then  (2.6a)  implies  that 

(2.8)  n(s)  = JQ  ( 0 ) = n(l)  = -X.i  + v;j, 

which  when  coupled  with  (2.7c)  yields 
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Figure  2.1 


(2. 

. 9a) 

3(s)  = 

-X  cos  9 (s) 

+ v sin  0 (s)  , 

(2. 

.9b) 

H ( S ) = 

X sin  9 (s) 

+ v cos  9 (s)  . 

The  substitution  of  (2.3),  (2.7),  (2.9)  into  (2.6b)  reduces 

the  latter  to 

(2.10)  M*  + (X  sin  9 + v cos  0)  (!+£;)  + (X  cos  0 - v sin  0)r)  = 0. 
We  introduce  the  constitutive  functions 

(2.11)  (w,s)  (Ex  [0,1]  | — -<■  (S (w,s) ,H(w,s) ,M(w,s) ) € OR3 

with  S,  H,  M in  C4(E*  [0,1]).  Then  our  constitutive 
equations  are 

(2. 12a,b)  S (s)  = S(w(s),s),  H(s)  = H(w(s),s), 

(2.12c)  M (s)  = M (w  (s)  , s)  . 

Denoting  partial  differentiation  by  subscripts,  we  require 
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that 


(2.13) 


/ s. 


M 


be  positive-definite.  This  condition  ensures  that  an  in- 
crease in  tensile  force  corresponds  to  an  increase  in  exten- 
sion, an  increase  in  M corresponds  to  an  increase  in  (j,  = 
0',  etc.  (This  condition  is  intimately  related  to  the 
strong  ellipticity  condition  of  the  three-dimensional 
theory,  cf.  [8].)  We  also  require  that  extreme  strains 
must  correspond  to  extreme  resultants  by 


(2.14a) 


3 (£  ,-n  , |i,  s) 


as  £ 
as  £ 


(2.14b)  H U #r| » |x»s) 


as  g 


(2.14c)  M(£,T),p.,s) 

We  also  assume  that 


as  p. 


(2.15) 


H(5,0,(i,s)  = 0,  M(5,T),0,s)  = 0. 


Relations  (2.14)  and  the  positive-definiteness  of  (2.13) 
imply  that  if  S,  H,  M,  s are  prescribed  in  ]R3  * [0,1], 
then  the  algebraic  equations 

( 2 . 16a , b)  3(C,"n,|i.,s)  = S,  H(£,q,n,s)  = H, 

(2.16c)  M (£  ,t)  , |i , s)  = M 

have  a unique  solution  (C  » M-)  that  we  denote  by 

(2. 17a, b)  5 = C(3,H,M,s)  , q = rf (S,H,M,s)  , 
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(2.17c)  m-  = ^ (S » H/M,  s) 

and  that  if  S,  H,  p.,  s are  prescribed,  then  (2.16a,b)  has 
a unique  solution  (£,q)  that  we  denote  by 

( 2 . 1 8a  , b)  £ = £(S,H,h,s),  q = q (S  , H , p.,  s)  . 

We  assume  that 

(2.19a)  £(S,H,|i,s) 

(2.19b)  q(S,H,n,s)  — ► ±»  as  H -»  +». 

(Cf.  (2.14).) 

Let  us  substitute  (2.12a,b)  into  (2.9).  Then  (2.18) 
implies  that  the  resulting  expressions  are  equivalent  to 

(2.20a) 

£ (s) 

= £(-Xcos0(s)  + vsin0(s),  Xsin0(s)+v  cos  0 (s)  , 0 ' (s)  , s) , 


(2.20b) 
r\  (s) 

= rj  (-X  cos  0 (s)  + v sin  0 (s)  , X sin  9 (s)  + v cos  0 (s)  , 0 ' (s)  , s)  . 
The  substitution  of  (2.12c)  and  (2.20)  into  (2.10)  yields 


[M(i,q,0'  ,s)  ] 1 + X [ ( l + £ ) sin  0 + q COS  0] 
(2.21)  . - . , 

+ v [ (l+£)  cos  0 - q sin  0] 


where  the  arguments  of  £ and  q are  those  listed  in 

(2.20) . 

We  examine  the  boundary  conditions 


(2.22) 


0(0) 


0 


0(1)  , 
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(2.23) 

(2.24) 

and  either 

(2.25) 

or 


r(O)*^  = 0, 

n (1) • i = -X, 


= 0 


(2.26)  n (1)  • i = v = 0. 

Note  that  v,  which  appears  in  (2.21),  is  not  prescribed 
by  the  boundary  conditions  ( 2 . 22) - ( 2 . 25 ) . (This  is  a 
"statically  indeterminate"  problem.)  To  use  (2.21)  in  this 
case,  we  note  that  (2.3),  (2.23),  (2.25)  imply  that 


(2.27)  y[0,X,v] 


1 

[ (l+£)  sin  6 + t)  cos  0]ds  = 0. 

0 


Now 


(2.28) 

. y [e,\,v]  = 

i 

{ [£^.sin0  + CrtCOsG]  sin0  + [ri  sin0  + g .,cos0]  cos0  }ds  > 0 

J0  i H “ H 

as  a consequence  of  the  positive-definiteness  of  (2.13). 
Relations  (2-19)  imply  that 

l (2.29)  y [0,X,v] — » ±«  as  v -+  ±». 

* 

Relations  (2.28),  (2.29)  support  a global  and  a local 
implicit  function  theorem  to  the  effect  that  (2.27)  has  a 
unique  continuously  differentiable  solution 

(2.30a)  (0,\)  € C1([0,1])  x [0,-)  I—  0(0, X] 


with 
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(2.30b)  0(0, X]  = 0 

by  (2.15).  Let  us  define  0=0  when  (2.26)  holds.  Then 
our  boundary  value  problem  consists  of  the  quasilinear, 
second-order,  ordinary-functional  differential  equation 
(2.21)  with  v replaced  by  v[0,X]  subject  to  (2.22). 
(Compare  this  with  (1.1).)  This  problem  has  the  trivial 
solution  pair  (0,\). 

We  wish  to  determine  to  what  extent  the  solution 
branches  of  our  boundary  value  problem  inherit  the  proper- 
ties of  the  eigenfunctions  of  the  linearization  of  this 
boundary  value  problem  about  the  trivial  solution.  We  ac- 
cordingly examine  this  linearization.  It  is  given  by 


(2.31a) 

[P(s,XH- 

] ’ + XQ(s,XH  = -Q(s 

jXJv^fOjX]^, 

(2.31b) 

i (0)  = o = t(i), 

where 

(2.31c) 

P ( s , X ) 

= M (U-X,0,0,S)  ,0,0 

n 

, s)  , 

(2. 31d) 

Q (s , X) 

= l+l (-X,0,0, s)  + 

XhH (-x , 0, 0,s)  , 

(2. 31e) 

v 0 [0 , X] ^ 

fl 

= - Q(s,XH  (s)  ds  / 

0 

/ f1  - 

' T|  (-X,0,0,s)  ds 

0 H 

if  (2.25) 

holds , 

(2. 31f ) 

v0  [0,XH 

=0  if  (2.26)  holds. 

Let  us  first  examine  (2.31a,b)  when  (2.31f)  holds. 

With  an  eye  toward  applying  the  Sturmian  theory  (cf.  [31], 
e.g.)  we  multiply  (2.31)  by  X a where  a > 0 and  note 
the  following  results. 

i)  If  £ = 0,  g = 0 so  that  the  rod  is  inextensible  and 
unshearable,  then  P is  independent  of  X and  Q = 1. 
(In  this  case  (2.31a)  reduces  to  the  linearization  of 
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(1.2)  about  0 = 0.)  The  Sturmian  theory  then  implies 
that  the  problem  has  eigenvalues  X^  < X^  < ...  with 
Xq  > 0 and  corresponding  eigenvectors  ^^,1^,... 
with  \|r  having  exactly  k simple  zeros  in  (0,1). 

ii)  If  the  rod  is  extensible,  then  the  three-dimensional 
interpretation  of  our  resultants  S,  H,  M (which  we 
do  not  describe  here)  implies  that 

(2.32)  \[1  + i (-X,0,0,s)  ]/P(s,X)  — 0 as  X - « , 

(2.33)  if  there  is  a number  A 2 0 such  that 

[X  ttP(s,X) ]x  5 0 for  X 2 A 2 0,  then  a 2 1, 

(2.34)  if  a 2 1,  then  X1  a [ 1 + £ (-X  , 0 , 0 , s ) ] — *■  0 as 
X -*  °°  • 

We  then  have  the  following  alternatives: 

a)  If  T]  = 0,  then  X^  aQ(s,X) — ► 0 and 

XQ  (s  , X ) /P  ( s , X ) — *•  0 as  X -»  ».  The  Sturmian 
theory  implies  that  the  eigenvalues  are  confined 
to  a bounded  interval  of  (0,°°).  Mild  restric- 
tions on  S prevent  the  existence  of  an  infinite 
number  of  eigenvalues  in  this  case.  When  such 
conditions  obtain,  there  are  but  a finite  number 
of  corresponding  eigenfunctions.  The  number  of 
nodes  of  these  eigenfunctions  does  not  correspond 
to  the  ordering  of  the  eigenvalues.  Thus  the 
relaxation  of  the  constraint  £ = 0 markedly 
• changes  the  character  of  the  spectrum  and  the 
nodal  structure  of  solutions. 

b)  If  g 0,  then  the  behavior  of  X1  aQ(s,X)  and 
XQ  (s , X) /P  (s,  X)  as  X -*■  « (and  hence  the  charac- 
ter of  the  spectrum  and  the  nodal  structure  of 
solutions)  devolves  upon  the  nature  of  the  shear 
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eigenvalue  corresponding  to  an  eigen- 
function with  n-1  interior  zeros. 


response  embodied  in  X t -*•  ri  (-X  , 0 , 0,  s)  . There  are 

H 

neither  theoretical  nor  experimental  results 
available  to  restrict  the  behavior  of  this  func- 
tion and  so  there  is  a rich  collection  of  physi- 
cally acceptable  possibilities  for  the  nodal 
structure  of  the  eigenfunctions.  This  variety  of 
response  is  readily  observed  in  the  constant  coef- 
ficient case.  The  eigenvalues  are  solutions  X 

2 2 

of  XQ{X)/P{X)  = n n for  n a positive  integer. 
(See  Fig.  2.2.)  To  each  solution  X of  this 
equation  (there  may  be  many  or  none)  there  corre- 
sponds just  one  eigenf unction  s l — *•  sin  nits.  We 
also  note  that  (2.31a,b)  may  have  negative  eigen- 
values. These  correspond  to  shear  instabilities, 
which  we  discuss  in  a more  general  setting  in 
Section  6. 


These  results  indicate  that  the  nature  of  the  collec- 
tion of  solution  branches  for  our  general  quasilinear  prob- 
lem when  ( 2 . 3 1 f ) holds  may  be  markedly  different  from  that 
for  (1.1).  These  differences  may  become  more  pronounced 
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when  (2.31e)  holds.  To  gain  some  insight  into  the  source 
of  the  difficulties  in  this  case,  we  introduce  the  Priifer 
transformation 

(2.35a,b)  \|e  = p sin  co  + £,  X aP\|c ' = p cos  u, 

,1 

Q ( s , X ) \|c  ( s ) ds 

0 

r1  * 

X rj  (-X,0,0,s)  ds 
0 H 

1 

Q(s,X)p(s)  sin  ails)  ds 

- 0 

1 

[1  + £(-X,0,0,s)  ] ds 

0 

if  (2.31e)  holds  and  £ = 0 if  (2.31f)  holds.  Then 
(2.31a,b)  is  equivalent  to 

(2.36a)  p1  = (XaP  ■*"  - X aQ)  p sin  co  cos  co , 

(2.36b)  co'  = XaP  cos^  co  + X^  aQ  sin^  co, 

(2.36c)  p(0)  sinco(O)  = ~K  = p(l)  sinco(l). 

By  analyzing  (2.36a,b)  in  the  usual  way  (cf.  [23],  e.g.)  we 
find  that  phase-plane  trajectories  of  (2.31a)  may  have  the 
general  form  indicated  in  Fig.  2.3.  Candidates  for  solu- 
tions of  (2.31a,b)  correspond  to  trajectories  beginning  and 
ending  on  the  line  jc  = 0.  Note  that  the  number  of  zeros 
of  \|c  on  the  trajectory  AF,  which  is  four,  does  not 
correspond  to  the  number  of  zeros  of  jr ' , which  is  seven, 
or  the  number  of  zeros  of  \|c  - £,  which  is  eight.  More- 
over as  X changes  so  does  the  shape  of  the  trajectory 
and,  in  particular,  the  value  of  Thus  it  can  well 


where 


£ = 


(2.35c) 
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Figure  2.3.  Typical  trajectory  of  (2. 'la) 

happen  that  the  number  of  interior  zeros  of  \|z  on  AF  can 
change,  although  the  number  of  interior  zeros  of  and 

t - £ would  not  change.  Now  consider  the  boundary  value 
problem  in  which  (2.31b)  is  replaced  by  the  conditions 
\|z'  (0)  = 0 = \|r'  (1).  Then  candidates  for  solutions  corre- 
spond to  trajectories  beginning  and  ending  on  the  jz-axis. 
Our  same  analysis  indicates  that  the  zeros  of  jz  are 
singularly  inappropriate  for  the  characterization  of  eigen- 
functions. This  difficulty  becomes  even  more  pronounced  if 
(2.31b)  is  replaced  by  mixed  conditions.  Note  that  these 
difficulties  arise  solely  from  the  fact  that  K need  not 
be  zero.  Indeed,  the  collection  of  eigenfunctions  of  such 
problems  can  nearly  be  characterized  by  collections  of  J- 
tuples  of  integers,  each  integer  representing  the  number  of 
zeros  of  a function  such  as  \|z , \|z ' , \|z that  is 
associated  with  the  eigenfunction  \| /.  (When  mixed  boundary 
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conditions  are  used,  this  list  of  associated  functions 
should  be  supplemented  with  appropriate  linear  combinations 
of  \|f  and  \|('  that  are  related  to  the  boundary  condi- 
tions.) We  call  each  such  J-tuple  a label  for  the  eigen- 
function. 

Let  us  now  return  to  our  nonlinear  boundary  value  prob- 
lem. The  global  theory  of  bifurcation  introduced  by 
Crandall  & Rabinowitz  [25]  and  further  developed  by 
Rabinowitz  (cf.  [49-52])  may  be  applied  to  our  problem. 

This  theory  implies  that  if  X is  a nonnegative  eigenvalue 
of  (2.31a,b)  of  odd  algebraic  multiplicity,  then  there  is  a 
"branch"  of  solution  pairs  (0,X)  € C^([0,1])X  K bifurcat- 

ing from  (0,X)  that  is  unbounded  in  C^([0,1])  * ]R  and/or 
returns  to  the  trivial  solution  branch  at  (0,;i),  where  p 
is  another  eigenvalue  of  (2.31a,b).  If  X is  a simple 
eigenvalue  of  (2.31a,b)  and  if  (6,X)  lies  near  (0,X)  on 
the  branch  through  (0,X),  then  6 behaves  like  the  eigen- 
function \|/  of  (2.31a,b)  corresponding  to  X.  In  particu- 
lar, the  numbers  of  simple  interior  nodes  of  9 and  of  9' 
or  M (£  ,g  , 8 ' , • ) agree  with  those  for  and  Now  if 

the  number  of  simple  interior  nodes  of  9 should  change, 
then  (2.22)  implies  that  9 must  have  a double  zero  at  some 
t ( [0,1].  A standard  uniqueness  theorem  then  implies  that 
0=0.  Thus  the  number  of  simple  interior  npdes  of  9 can 
only  change  at  the  trivial  solution.  To  examine  the  nodal 
structure  of  M,  we  first  set 


6 [ 9 , X ] = arctan(v  [9,X]/X) , y(s)  = '0  (s)  + 6, 


(2.37) 

For  X > 0, 
satisfies 


N [ 9 , X ] = (X2  + v [ 0 , X ] 2) 1/2  . 

6 f (-n/2 , n/2 ) . Then  (2.21)  implies  that 


Y 


(2.38a)  [M(£,g,y' ,s) ] ' + N[(l+£)  sin  y + g cos  y]  = 0, 

A A 

where  the  arguments  of  £ and  g are 
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(2.38b)  -N  cos  y,  N sin  y,  y',  s. 

If  M should  change  the  number  of  its  internal  nodes,  then 
(2.22)  and  the  positive-definiteness  of  (2.13)  imply  that 
either  0(0)  = G'(0)  = 0,  or  0(1)  = 0'(1)  = 0,  or  there 
is  a t € (0,1)  such  that  y' (t)  = M(t)  = M' (t)  = 0.  In 

the  first  two  cases,  our  previous  argument  shows  that  0 

must  equal  zero.  In  the  last  case  (2.37)  and  (2.38)  imply 
tnat 

(2.39a)  (l+£l  siny(t)  + r)cosy(t)  = 0 

where  the  arguments  of  % and  rj  are 

(2.39b)  -Ncosy(t),  Nsiny(t),  0,  t. 

A careful  analysis  of  (2.39)  in  conjunction  with  either 
(2.26)  or  (2.27)  yields  that  if  | 0 ( t ) | 5 n/2  then  0(t)  = 

0.  Since  0' (t)  =0,  we  get  that  0=0  as  before.  We 

may  thus  conclude  that  on  the  portion  of  the  branch  from 
( 0 , X ) to  its  first  contact  with  { (<p  , X ) : ||<p||  q = n/2} 

the  function  M has  the  same  nodal  structure  as  (Where 

| 0 | exceeds  n/2,  the  rod  is  in  tension  and  may  suffer  a 
shear  instability.  The  shear  instability  [11]  may  be  re- 
sponsible for  the  break-down  of  the  nodal  structure.  See 
Section  6.)  Similar  results  hold  for  other  functions  de- 
pending on  0.  (These  are  harder  to  prove.)  Thus  it  is 
possible  to  show  that  the  nodal  structure  of  a solution 
branch  through  (0,X),  which  is  characterized  by  a label, 
is  inherited  from  that  of  ^ and  is  preserved  in  some 
large  neighborhood  of  the  trivial  solution. 

A simple  perturbation  analysis  (cf.  [33]),  which  can 
be  readily  justified  (cf.  [34,54,60]),  shows  that  whether 
or  not  X increases  on  a branch  as  it  leaves  a bifurcation 
point  (0,xj  depends  on  the  first  three  derivatives  of 
w i— »S(w,s),  H(w,s),  M(w,s)  evaluated  at  the  trivial  sol- 

»V  O/  X/ 

ution.  Thus  the  nature  of  material  behavior  can  be  an 
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important  contributor  to  the  process  by  which  the  straight 
state  loses  its  stability.  (Cf.  [46].) 

Bibliographical  Remarks.  The  results  described  in  this  sec- 
tion are  based  upon  the  much  more  extensive  development  of 
[16].  Modern  treatments  of  bifurcation  problems  for  (1.1) 
under  statically  determinate  conditions  are  given  by  [24, 
25,34,40,53].  Bifurcation  problems  for  nonlinearly  elastic 
rods  having  governing  equation  intermediate  in  complexity 
between  (1.2)  and  (2.21),  (2.29)  have  been  treated  by  [29, 

46,47,55] . 

3 . Spatial  Buckling  of  Straight  Rods 

In  this  section  we  indicate  how  the  theory  described 
in  Section  2 can  be  extended  to  treat  the  spatial  buckling 
of  straight  rods  under  terminal  thrust  and  twist 
(Greenhill's  problem,  cf.  [42]).  These  nonlinearly  elastic 
rods  can  suffer  flexure,  torsion,  extension,  and  shear. 

In  place  of  (2.1),  we  characterize  the  configuration 
of  a rod  by  a triple  of  vector  functions 

(3.1)  s ( [0,1]  | * r (s)  , b.  (s)  , b_  (s)  t IE3 

with  b.  and  b„  forming  an  orthonormal  pair.  We  set 

b,  = b,  x b.  so  that  [b. ,b_,b.}  form  a basis  for  IE  . 

~i.  ~1 

We  may  represent  these  vectors  by  Euler  angles  0,  \|x,  <p 
(cf.  [42,  §253]).  We  define  the  strains  [y^u.,  k = 1,2,3} 
by 


= (ueVx£k- 

Here  and  below  we  sum  repeated  Latin  indices  over  the  range 
1,  2,  3.  The  form  of  (3.3)  reflects  the  orthonormality  of 
{ b^  > . We  require  that 

y,  = r' *b0  > 0 . 

3 ~ ~3 


(3.2) 

(3.3) 


r ' 


b’ 

~k 


(3.4) 
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(Cf.  (2.4).) 

If  the  only  loads  are  terminal,  then  the  equilibrium 
equations  are  given  by  (2.6),  which  we  rewrite: 

(3.5a,b)  n ’ = 0 , m ’ + r ' x n = 0 . 

These  equations  immediately  imply  that 

( 3 . 6a , b)  n (s)  =n(l),  m(s)-n(l)  =m(l)-n(l). 

We  set 


(3.7a,b)  n(s)  = nk(s)bk(s),  m(s)  = mk(s)bk(s). 

From  (3.6)  we  then  get 

(3.8)  n.  (s)  = n(l)*b.(s),  m (s) n (1 ) • b,  (s)  = m(l)-n(l). 

K ~ K ~ ~ ~ 

(The  first  three  equations  of  (3.8)  are  analogous  to  (2.9). 
The  right  sides  of  these  equations  depend  upon  the  Euler 
angles.)  By  substituting  (3.7)  into  (3.5),  we  get  two  more 
equations,  which  are  independent  of  (3.8): 


(3.9)  m^  = u2m1  - u^  + y^  - y1n2 

(3.10)  uk[m*  + ekemy^nm]  = 0. 

Here  is  the  alternating  symbol. 

We  introduce  constitutive  functions 

(3.11)  (y^  »u£  » s)  | ► nk(y4,ug,s),  mk(yg,ug 

so  that  our  constitutive  equations  are 

nk(s)  = nk (y^ (s) ,nt  (s) ,s) , 


(3.12) 


mk(s)  = mk (y^ (s) ,u^ (s) , s) . 


s) 


We  require  that  the  6x6  matrix 
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(3.13) 


(dnk/ayg)  (ank/au^) 


(amk/aye)  (amk/au^) 


be  positive  definite,  that  (n^,n2)  and  (m^ , ) each  be 
isotropic  2-vector  functions  of  the  pair  of  2-vectors 
(y1,y2)  and  (u^u  ),  that 

n2  = 0 if  and  only  if  y2  = 0, 

(3.14) 

m2  = C if  and  only  if  u2  = 0, 


and  that  these  constitutive  functions  satisfy  growth  condi- 
tions like  (2.14),  (2.19).  The  isotropy  condition  and 

(3.14)  imply  that  the  substitution  of  (3.12)  into  (3.9)  re- 
duces the  latter  to 


A 

(3.15)  m3 (y« (s) »u£ (s) , s)  * m3(l). 

This  equation  and  the  four  obtained  by  substituting  (3.12) 
into  (3.8)  represent  a collection  of  five  integrals  of  the 
governing  equations.  The  positive-definiteness  of  (3.13) 
and  the  growth  conditions  justify  a global  implicit  function 
theorem  to  the  effect  that  these  five  equations  can  be 
solved  for  the  strains  (yk)  and  two  Euler  angles  \|r , cp 
in  terms  of  the  third  Euler  angle  8 (the  nutation  angle) . 
The  substitution  of  these  solutions  into  (3.10)  reduce  it 
to  a single  second-order,  quasilinear  ordinary  differential 
equation  for  8,  depending  upon  the  parameters  n(l)  and 
m(l).  This  equation  is  a complicated  generalization  of 
(2.21).  After  boundary  conditions  are  imposed,  we  can  study 
our  buckling  problem  by  imitating  the  development  of  Sec- 
tion 2 that  follows  (2.26).  Not  only  does  this  buckling 
problem  differ  from  that  of  Section  2 in  the  complexity  of 
the  governing  equations,  but  it  also  has  two  eigenvalue 
parameters,  the  thrust  and  twisting  moment,  in  place  of  X, 
and  it  has  up  to  four  statically  indeterminate  reactions  in 
place  of  v . The  presence  of  two  eigenvalue  parameters 
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means  that  solutions  branch  off  in  sheets  from  the  eigen- 
curves  of  the  linearized  problems.  The  theory  of  Crandall 
and  Rabinowitz  can  be  modified  to  show  that  these  sheets 
may  be  characterized  by  their  nodal  structure. 

Bibliographic  Remarks.  The  formulation  of  this  section  is 
based  on  [7,12].  Also  see  [42].  The  full  analysis  of  the 
global  behavior  of  buckled  states  is  given  in  [13].  A 
buckling  problem  for  Kirchhoff's  theory  is  studied  by  [39], 
which  shows  the  existence  of  some  interesting  secondary  bi- 
furcations. It  should  be  noted  that  the  presence  of  the 
twisting  makes  the  connection  between  the  bifurcation  anal- 
ysis and  stability  of  motion  more  tenuous  than  usual  because 
the  twisting  couple  need  not  be  conservative  (cf.  [22,37, 
48])  . 


There  is  a related  problem  that  leads  to  a somewhat 
more  complicated  set  of  equations  than  those  of  this  sec- 
tion: Find  all  the  travelling  waves  that  can  be  sustained 

in  a homogeneous  infinite  rod  with  hyperelastic  constitu- 
tive equations  of  the  form  (3.12).  The  homogeneity  and  the 
hyperelasticity  imply  that  there  is  a function 
(ye,u^)  I — » W(y^,ue)  such  that 

(3.16)  nk  = dW/ayk,  mk  = aW/auk< 

If  the  rod  is  doubly-symmetric , then  the  equations  of  motion 
have  the  form 


(3.17)  n = Ar  . , m +r  *n  = J(b.  *b,  + b, * b_  ) , 

~s  ~tt  ~s  ~ ~1  ~lfc  -2  ~2fc  t 

where  A and  J are  given  positive  numbers.  In  a travel- 
ling wave  all  the  fields  are  taken  to  depend  on  s,  t 
through  the  combination  K = s-ct.  Denoting  derivatives 
with  respect  to  K by  a prime,  one  can  show  that  (3.16)  - 

(3.18)  imply  that 


(3.18) 


c2Ar 1 


a (const. ) , 
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(3.20) 

(3.21) 


m^  - 2c  Ju^  = p (const.), 

% + Uk  - W - 4 [A*k*k  + J(UH+2U3)] 

= h (const. ) . 


Thus  the  problem  is  "completely  integrable"  and  would  be 

reducible  to  the  study  of  phase-plane  diagrams  were  it  not 

. 2 

for  the  fact  that  the  terms  with  c can  destroy  the  mono- 
tonicity of  the  left  sides  of  (3. 18) - (3. 20) . Thus  the 
solvability  of  (3. 18) - (3 . 20)  depends  critically  upon  the 
size  of  c,  the  asymptotic  or  average  values  of  the  loads, 
the  size  of  solutions,  and  the  behavior  of  the  constitutive 
functions.  One  can  exploit  these  variables  to  prove  a var- 
iety of  existence  and  nonexistence  theorems.  The  qualita- 
tive behavior  may  be  determined  by  multiple-valued  phase- 
plane  trajectories.  Under  certain  critical  conditions  one 
can  show  that  there  are  pulses  of  the  form  of  Fig.  3.1  with 
the  number  of  bays  arbitrary!  This  travelling  wave  problem 
is  treated  in  [14]. 

4 . Buckling  of  Rotating  Rods 

In  this  section  we  merely  formulate  a particular  prob- 
lem that  describes  the  buckling  of  a rotating  rod  and  indi- 
cate some  of  the  interesting  effects  that  can  arise.  We 
employ  the  theory  of  Section  3.  If  the  rod  is  rotating 


c 
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with  constant  angular  velocity  co  about  an  axis  with  direc- 
tion k passing  through  the  origin  0 of  E3,  then  the 
acceleration  term  is  equivalent  to  a centrifugal  body  force 

(4.1)  f(r(s),s)  = p(s)to2{r(s)  - [r(s)-k]k). 

'v  'W  1 r\j  y\y 

Here  p(s)  is  the  mass  per  unit  length  at  s in  the  ref- 
erence configuration.  We  shall  assume  that  p = const.  In 
place  of  (3.5),  we  have 

(4.2a,b)  n’  + f (r)  =0,  m1  + r'  x n = 0. 

/\y  y\y  rv  /V  y\. 

Let  us  suppose  that  the  reference  configuration  of  the 
rod  is  given  by 

(4.3)  r (s)  = si,  b (s)  = j,  b~(s)  = k,  s(  [1-L,1]  . 

»\y  ^ <v  I i<y  rvy  ^ r-j 

Here  is  an  orthonormal  basis,  s is  the  arc 

length  parameter  of  the  reference  axis,  and  L is  the 
length  of  this  axis  in  the  reference  configuration. 

We  suppose  that  the  end  s = 1 of  the  rod  is  welded 
to  a rigid  ring  that  is  constrained  to  lie  in  a plane  per- 
pendicular to  the  k-axis  and  to  rotate  about  it  with  con- 
stant angular  velocity  to.  We  assume  that  (i.,^)  are 
fixed  to  this  ring.  We  assume  that  the  end  s = 1 - L is 
free.  Then  our  boundary  conditions  are 

(4.4)  r(l)  = i,  b^l)  = X,  b2(l)  = k, 

(4.5)  n(l-L)  = 0,  m ( 1-L)  = 0. 

(There  are  but  six  scalar  equations  in  (4.4)  because  b^ 
and  b2  are  unit  vectors  and  because  the  problem  is  invar- 
iant under  translations  along  k. ) Our  boundary  value  prob- 
lem consists  of  the  equilibrium  equations  (4.2),  the 
"strain-displacement"  equations  (3.2),  (3.3),  the  constitu- 
tive equations  (3.12),  and  the  boundary  conditions  (4.4), 

2 

(4.5) .  a)  plays  the  role  of  the  eigenvalue  parameter. 
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A trivial  solution  is  one  for  which  the  rod  remains 
unbent,  unsheared,  and  untwisted.  As  is  typical  for  geo- 
metrically exact  theories  of  nonlinear  elasticity,  a triv- 
ial solution  may  not  be  simple:  In  the  trivial  state  r 
has  the  form 

(4.6)  r(s)  = r(s)i  with  r' (s)  = y~ 

rss  rsj  J 

so  that  (3.14)  implies  that  n has  the  form 

(4.7)  n (s)  = S (s) i . 

Moreover,  in  this  state 

(4.8)  u^  = u^  = u3  = 0 so  that  m^  = m2  = n>3  = 0. 

Thus  the  boundary  value  problem  for  the  trivial  solution 
reduces  to 

(4.9)  3 (r ' ) ' + pco2r  = 0,  si  (1-L,1), 

(4 . 10a , b)  r (1)  = 1,  £(r'(l-L))  = 0. 

Here  3(y3)  * n3 (0 , 0 ,y3 , 0 , 0, 0) . 

That  3 is  a strictly  monotone  function  from  (0,-) 
onto  (--,-)  and  that  (4.9)  is  autonomous  give  us  powerful 
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machinery  with  which  to  attack  (4.9),  (4.10).  The  analysis, 

which  we  omit,  is  nevertheless  tricky  because  the  problem 

2 

has  two  essential  scalar  parameters  to  and  L,  as  well 
as  the  infinite-dimensional  parameter  S.  The  results  of 
the  analysis  agree  with  those  of  the  following  heuristic 
argument:  If  L 5 1 then  there  will  be  a compressed  con- 

figuration with  r(l-L)  > 0.  The  existence  of  this  solu- 
tion is  a consequence  of  the  requirement  that  S(y3>\  -- 
as  y3\0.  This  means  that  the  material  is  strong  enough 
to  resist  any  compression  due  to  the  centrifugal  force. 

There  might  also  be  a trivial  state  with  part  of  the  rod  in 

tension  if  the  deformed  length  exceeds  1.  This  configura-  ' 

2 ! 
tion  would  occur  if  to  can  be  so  chosen  so  that  the  cen- 
trifugal forces  can  balance  the  tensile  stresses.  In  par- 
ticular, if  H is  superlinear  for  large  values  of  its 

argument,  then  one  can  expect  the  existence  of  solutions 

2 

with  r(l-L)  < 0 for  to  sufficiently  large,  whereas  if 

a ; 

S is  sublinear  for  large  values  of  its  argument,  then 

existence  of  such  solutions  could  only  be  expected  for 
2 

small  to  . These  same  considerations  give  rise  to  some- 
what different  results  when  L > 1.  In  short,  there  can  be 

2 

zero,  one,  or  more  trivial  solutions  for  each  value  of  to  , 

A 

with  this  number  depending  on  L and  S. 

Local  bifurcation  analyses  for  the  full  system  of 
equations  can  be  carried  out  in  the  neighborhood  of  eigen- 
values of  the  linearizations  about  each  trivial  branch, 
especially  the  trivial  branch  containing  the  completely 
unstrained  state  for  to  = 0. . Although  the  nonlinear  equa- 
tions do  not  reduce  to  a second  order  ordinary  differential 
equation,  there  is  evidence  suggesting  that  a global  quali- 
tative analysis  along  the  lines  of  Sections  2 and  3 may 
nevertheless  be  possible. 

The  strange  behavior  of  the  trivial  solutions  is 
reflected  in  that  of  bifurcated  states.  This  behavior  is  a 
manifestation  of  the  coercivity  problem  caused  by  the  cen- 
trifugal force:  The  centrifugal  force  behaves  like  a 
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linear  functional  of  the  strains.  Coercivity  conditions 
like  (2.14a)  show  that  the  rod  can  always  resist  large  com- 
pressive stresses  but  might  fail  to  resist  large  tensile 
stresses  if  the  constitutive  functions  are  not  ultimately 
superlinear.  When  (3.16)  holds,  the  problem  is  variational. 
If 


U 

(4.11) 

V 


(l  " 

1 1-L 


(Yk  (s) ,uk  s) ) ds, 


p|r(s)  -r(s)-kk|  ds. 


1-L 


f'J  rsy  • 


then  the  governing  equations  are  equivalent  to  the  Euler- 

2 

Lagrange  equations  for  the  functional  U - co  v.  The  problem 
of  minimizing  this  functional  immediately  leads  to  diffi- 
culty with  coercivity  described  above.  The  coercivity  prob- 
lem evaporates  if  we  consider  the  problem  of  minimizing  U 
on  the  manifold  defined  by  V = const.  But  then  the  multi- 
plier rule  gives  us  equations  that  are  equivalent  to  the 

2 

governing  equations  except  that  the  parameter  oo  is  re- 
placed by  a multiplier  X.  The  difficulty  is  thus  shifted 
to  that  of  verifying  that  X is  positive.  Similar  remarks 
apply  to  the  problem  of  minimizing  or  maximizing  V on  the 
manifold  defined  by  U = const.  (See  [3,9]  for  technical 
details  on  the  variational  problems.)  These  difficulties 
are  just  like  those  for  the  travelling  wave  problem  de- 
scribed at  the  end  of  Section  3 and  are  closely  related  to 
those  that  can  arise  for  the  problems  of  Section  5. 

Bibliographic  Remarks.  This  and  other  problems  for  rotat- 
ing bar$  are  being  investigated  in  [15] . A problem  for  a 
rotating  elastica  was  analyzed  in  [45).  Since  the  elastica 
is  inextensible , it  cannot  suffer  many  of  the  effects  de- 
scribed in  this  section.  These  problems  may  be  regarded  as 
successors  to  problems  for  rotating  strings.  The  results 
of  [38]  constitute  one  of  the  most  beautiful  global 
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analyses  in  the  applications  of  bifurcation  theory.  In  this 
connection,  see  [56]. 


5.  Buckling  of  Rings  and  Arches 

We  consider  the  planar  buckling  of  a nonlinearly  elas- 
tic ring  under  hydrostatic  pressure.  We  employ  the  same 
theory  as  that  used  in  Section  2.  The  hydrostatic  pressure 
has  intensity  p per  unit  actual  length  of  the  deformed 
ring.  Then  in  place  of  (2.7),  (2.9)  and  (2.10),  we  have 

the  scalar  system  of  equilibrium  equations 


(5.1a)  S'  - H0 ' - pg  = 0, 

(5.1b)  H'  + H 0 ' + p(l+£)  = 0, 

(5.1c)  M'  + H(l+£)  - Eg  = 0. 


We  assume  that  the  reference  configuration  is  a uniform 
ring  of  radius  1.  Thus  s is  restricted  to  the  interval 
[0,2n]  and  the  constitutive  equations  (2.11)  are  indepen- 
dent of  s.  We  seek  solutions  of  (5.1),  (2.12)  for  which 

4 and  g have  period  2rr  and  for  which 

(5.2)  0(s+2n)  = 0 (s)  + 2n . 

We  set 

(5.3)  il  = e'  - 1 
so  that  (5.2)  is  equivalent  to 

f2n  - 

(5.4a,b)  ^.(s+2n)  = p.(s),  H-(s)  ds  = 0. 

1 0 

Rather  than  getting  a second-order  ordinary 
(-functional)  differential  equation  for  0 as  in  Sections  2 
and  3,  we  instead  obtain  something  equivalent  to  a second- 
order  equation  for  (i.  This  appears  to  be  a nonlinear  ana- 
log of  the  fact  that  classical  linear  theories  of  curved 
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rods  are  governed  by  systems  of  higher  order  than  those  for 
straight  rods.  From  (5.1)  and  (2.11)  we  get 

(5.5)  D(H2  + H2  - 2pM]  = 0, 

(5.6)  SDH  + gDH  + (1+^)DM  = 0, 


where 

(5.7) 


— + n ' — + ii ' — 

£ 95  " * aS 


Suppose  that  S and  M are  even  functions  of  g and 
that  H is  an  odd  function  of  g.  We  set 


(5.8) 


2co  = g . 


These  symmetry  results  enable  us  to  introduce  new  consti- 
tutive functions  for  S,  H/g,  M,  that  depend  on  co.  We 
substitute  these  into  (5.5)  and  (5.6)  and  then  divide  the 
resulting  equations  by  ia'.  We  obtain  a pair  of  ordinary 
differential  equations  for  £ and  u>  as  functions  of  p.. 
The  positive-definiteness  of  (2.13)  enables  us  to  solve 
these  equations  in  a suitably  large  neighborhood  of  p.  = 0 
(which  corresponds  to  a uniformly  compressed  circular  state) 
and  also  to  solve  (5.1),  (2.11)  for  as  a function  of 

£ , g,  fi . Combining  these  two  results  with  (5.8)  we  find 
that  |i  satisfies  a pair  of  equations  of  the  form 

(5.9)  IT'  = ±FQ  , 

where  F depends  on  3,  H,  M and  their  first  derivatives. 
The  two  signs  come  from  the  two  roots  g of  (5.8).  These 
equations  give  the  phase  plane  trajectories  for  p..  By 
studying  these  trajectories  in  conjunction  with  (5.4),  one 
can  prove  the  following  results:  Under  mild  constitutive 
restrictions  solutions  preserve  their  least  periods  inher- 
ited from  the  eigenfunctions  linearized  about  the  trivial 
solution  in  a large  neighborhood  of  the  trivial  solution. 
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In  this  neighborhood,  every  simple  configuration  of  the 
ring  must  have  at  least  two  axes  of  symmetry.  The  number 
of  axes  of  symmetry  are  likewise  preserved  in  this  neighbor- 
hood. 

We  note  that  the  proof  of  these  results  did  not  re- 
quire the  apparatus  of  Crandall  and  Rabinowitz  because  the 
periodicity  conditions  (5.4)  greatly  simplify  the  analysis. 
On  the  other  hand,  bifurcation  problems  for  circular  arches 
under  hydrostatic  pressure  would  require  an  analysis  along 
the  lines  of  that  of  Section  2.  In  place  of  (2.21),  we 
would  have  the  second  order  ordinary  differential  equation 
corresponding  to  (5.9). 

Bibliographic  Remarks.  The  results  of  this  section  are 
based  on  [1,5,6].  The  first  two  of  these  references  exam- 
ine unshearable  rods  in  detail,  [1]  giving  a local  bifurca- 
tion analysis.  Local  and  global  results  for  the  elastica 
case  are  developed  from  a modern  viewpoint  in  [57].  The 
contact  problem  for  a buckled  elastica  ring  is  treated  in 
[28].  The  global  qualitative  analysis  for  nonlinearly 
elastic  arches  is  still  open.  Such  problems  should  exhibit 
some  interesting  effects  that  depend  on  the  shallowness  of 
the  arches.  In  particular,  linearized  analyses  of  simpler 
engineering  models  (cf.  [58],  e.g.)  suggest  that  buckling 
is  initiated  in  an  asymmetric  mode,  although  the  stable 
state  of  large  deflection  is  symmetric.  These  factors  are 
closely  related  to  the  phenomenon  of  snapping- through . 


6 . Tensile  Instabilities  of  Rods 

In  Section  2 we  found  that  the  qualitative  behavior  of 
solution  branches  could  change  markedly  when  parts  of  the 
buckled  rod  are  in  tension.  (Cf.  the  paragraph  following 
(2.39).)  Moreover,  an  examination  of  the  linear  eigenvalue 
problem  (2.31)  shows  that  it  may  well  possess  negative 
eigenvalues.  In  this  section  we  subsume  the  study  of  such 
tensile  instabilities  for  the  theory  of  Section  2 under 
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that  for  a richer  theory  in  which  the  rod  can  also  suffer 
transverse  contraction.  We  again  define  the  configuration 
of  the  rod  by  the  two  planar  vector  functions  r and  b 
of  (2.1),  except  we  now  suspend  the  requirement  that  b be 
a unit  vector.  We  define 

(6.1)  1 + p(s)  5 | b ( s ) | , o(s)  = p' (s) . 

In  addition  to  (2.4)  we  demand  that 

(6.2)  1 + p > 0. 

The  strains  for  this  theory  are 

(6.3)  w = (£ ,g , p,a, p) . 

We  let  S,  H,  P,  Z , M denote  the  generalized  forces 
corresponding  to  £ , rj , p,  a,  p (according  to  a suitable 
principal  of  virtual  work.  S,  H,  M are  the  resultants  de- 
fined in  Section  2;  P and  Z can  be  defined  as  certain 
stress  integrals  [3,11]).  Then  the  equilibrium  equations 
consist  of  (2.6)  and 

(6.4)  Z'  - M ( 1+  p)  ~ 1 9 ' - P = 0. 

If  the  rod  is  homogeneous,  then  it  has  constitutive 
equations  of  the  form 

(6.5)  S (s)  = 3 ( w ( s ) ) , H ( s ) = H (w  (s)  ) , P(s)  =P(w(s)), 

Z (s)  = Z (w  (s)  ) , M ( s ) = M (w  ( S ) ) . 

The  analog  of  the  strong  ellipticity  condition  is  that 


I 


be  positive  definite  and 

(6.6b)  P > 0. 

P 

In  addition  to  (2.14),  we  require 

as 

as 

(6.7b)  £ (w)  — * +»  as 


(6.7a) 


P (w) 


that 

P -*• 
P -*• 


a -* 


The  requirements  that  it  be  no  easier  to  shear  or  bend  the 
rod  in  one  sense  than  in  the  opposite  sense  and  that  the 
same  effort  is  needed  to  produce  a as  to  produce  -a  is 
made  precise  by  the  following  conditions  in  which  the  argu- 


ments 

K and  p 

A 

are  suppressed: 

(6.8a) 

3 (g  ,o , p) 

= S(g,-o,-p)  = 

2 (-g,a. 

-p)  = 3 (~g  »-a, p) , 

(6.8b) 

H (g  , a , p) 

= H(g,-a,-p)  = 

-H (-g,a, 

-p)  = -H (-g ,-a, p) , 

A 

A 

A 

A 

(6.8c) 

P (g  , a , p) 

= P(g,-o,-p)  = 

P (~g , o , 

-p)  = P (-g , -o , p) , 

A 

A 

A 

(6 . 8d) 

Z (g  ,a,  p) 

= -t  (g ,-a,-p)  = 

Z (-g , a , 

-p)  = -Z (-g , -a , p) , 

A 

(6. 8e) 

M (g  , o , p) 

= -fl(g,-a,-p)  = 

-M (-g , a , 

-p)  = M (-g , -o , p) . 

We 

consider 

the  boundary  conditions 

r ( 0)  = 0, 


= (l+y)i  with  y > 0, 

r 


(6.9a) 


r (1) 
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(6.9b) 

2(0)  = 0, 

2d)  = 0, 

(6.9c) 

6(0)  = 0, 

6(1)  = 0. 

Condition  (6.9a)  prescribes  the  elongation  y,  which  serves 
as  the  eigenvalue  parameter  in  place  of  the  tensile  force 
L.  (It  is  technically  more  convenient  to  use  y instead  of 
i.)  The  remaining  two  conditions  ensure  that  there  are 
trivial  solutions  to  the  governing  equations. 

Our  boundary  value  problem  consists  of  (2.2)  with  b 
replaced  by  b/|b|,  (2.3),  the  three  scalar  equilibrium 

equations  implicit  in  (2.6),  the  equilibrium  equation  (6.4), 
the  constitutive  equation  (6.5),  and  the  boundary  conditions 

(6.9) .  By  using  (2.8),  one  can  reduce  the  equations  to  a 
fourth-order,  quasilinear  system  of  ordinary  differential 
equations.  Since  the  order  is  not  2,  it  would  be  very 
difficult  to  extend  to  this  problem  our  results  on  nodal 
structure  of  Section  2,  because  these  results  rely  on 
Sturmian  theory  and  uniqueness  theorems  suitable  only  for 
second  order  systems.  Consequently  we  limit  our  attention 
primarily  to  the  local  analysis  of  bifurcation. 

Using  (6.6b),  (6.7a),  we  can  readily  show  that  our 

boundary  value  problem  possesses  a trivial  solution 

(6.10)  £ = y , g = 0,  p = p(y),  6 = 0, 

where  p(y)  is  the  unique  solution  of  P(y,0,p,0,0)  = 0. 
This  solution  describes  an  elongated  rod  with  uniformly 
contracted  and  unsheared  cross-sections. 

We  study  our  boundary  value  problem  by  a perturbation 
scheme  [33]  that  converges  at  least  for  branches  emanat  ng 
from  simple  eigenvalues  of  the  problem  linearized  about 

(6.10) .  (The  convergence  may  be  proved  by  the  Poincare 

shooting  method  [34]  or  by  the  method  of  Lyapunov  and 
Schmidt  [54,60].)  We  introduce  an  amplitude  parameter  a 
and  assume  that  on  a solution  branch  y and  the  geometric 
variables  have  representations  of  the  form 
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Y o + Yj_a  + Y2a2/2!  + Y3a3/3!  + 0(a4), 

+ C1(s)a  + ^2(s)a2/2!  + S3<s)a3/3! 

+ 0 (a4 ) , etc . 

We  substitute  (6.11)  into  our  boundary  value  problem.  We 
get  conditions  for  Y() ' Y1 ' Y2 ' * * ’ ' ^1 ' ^2 ' ‘ ' etc.,  by  repeat- 

edly differentiating  these  equations  with  respect  to  a 
and  then  setting  a = 0.  The  conditions  of  (6.8)  reduce 
the  determination  of  these  unknowns  to  a sequence  of  third- 
order  (rather  than  aixth-order)  linear  ordinary  differential 
equations  with  constant  coefficients.  These  equations  are 
readily  analyzed.  We  find  that  there  are  two  different 
kinds  of  bifurcation  processes  corresponding  to  two  differ- 
ent kinds  of  eigenvalues  of  the  linearized  problems.  Let 

(6.12)  P (y)  s S[s  - (l+y)H  ] / H M , 

T)  T)  p. 

where  the  constitutive  functions  are  evaluated  at  the  tri- 
vial solution  (6.10).  Then  one  set  of  eigenvalues  called 
eigenvalues  of  sheaf  type,  are  the  real,  non-zero  solutions 
y of  the  equations 

2 2 

(6.13a)  p(y)  = 4m  n for  m a positive  integer, 

(6.13b)  cot  (\fpW)/2)  = -2\/pTyTm  /(l+y)S 

with  the  constitutive  functions  evaluated  at  (6.10).  It  is 
not  hard  to  see  that  the  smallest  positive  root  y^  of 

((.13),  if  it  exists,  is  a root  of  (6.13b)  with  p(y  ) £ 

2 2 p 
(n  ,4n  ).  If  this  root  is  simple  and  distinct  from  the 

roots  of  (6.16)  below,  then  the  leading  terms  of  (6.11) 

corresponding  to  the  branch  emanating  from  y are  given 

by 
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(6.14) 


y^  = o. 


^(s)  = o. 


p,(s)  = o. 


(rj (s)  >(Pl  (s)  ) 


= const.  (1,-H  /H)  sin 


^p"(r  ) 


^pT 77) 


s sin 


(1-s). 


(Indeed,  if  Yp  is  any  simple  root  of  (6.13b),  then  (6.14) 
holds . ) Let 


(6.15) 


q(r>  - <P,S  -p  S )/H. z . 


f.-p  o' 

where  the  constitutive  functions  are  evaluated  at  (6.10). 
The  second  set  of  eigenvalues,  called  eigenvalues  of  neck- 
ing type , are  the  real,  non-zero  solutions  y of 

2 2 

(6.16)  q(r)  = n n for  n a positive  integer. 

Let  y be  the  smallest  positive  root  of  (6.16).  It  sat- 

q 2 

isfies  q(Yg)  = tt  . If  this  root  is  simple  and  distinct 
from  the  roots  of  (6.13),  then  on  the  branch  emanating  from 
Yg  the  leading  terms  of  (6.11)  are  given  by 


(6.17)  / 


YX  - 0 


U,  (s)  ,p.  (s)  ) = const.  (-S  /S  , 1 ) cos  nns, 
II  P K 


r|  1 (s)  =0, 


(Pj^  (s)  = 0. 


(The  same  formulae  apply  if  Yq  is  any  simple  root  of 
(6.16).) 

The  perturbation  analysis  shows  the  following  results: 

The  higher  order  terms  for  the  branch  emanating  from  Yp 

have  nonconstant  values  for  p^,  whereas  those  for  the 

branch  emanating  from  y have  tl  = cp.  = 0.  Thus  the 

CJ  K K 

development  of  a "shear  instability"  from  Yp  is  accompan- 
ied by  nonuniform  longitudinal  stretching  and  transverse 
contraction.  On  the  other  hand,  there  is  no  shearing 


108 


STUART  S.  ANTMAN 


Figure  6.1a.  The  undeformed  configuration  of  rod. 


Figure  6.1b.  The  shear  instability,  which  is  a 

deformed  configuration  on  the  branch 
emanating  from  y • 


Figure  6.1c.  The  necking  instablility , which  is  a 
deformed  configuration  on  the  branch 
emanating  from  y • 
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accompanying  the  initiation  of  the  necking  at  y . In  each 
case  the  value  of  and  in  particular  its  sign,  is 

determined  by  derivatives  up  to  order  3 of  the  constitu- 
tive functions  evaluated  at  the  trivial  solution.  From  the 
first  few  computed  terms  of  (6.11)  we  can  sketch  the  de- 
formed configurations  of  the  rods.  This  is  done  in  Fig. 
6.1.  These  shapes  are  like  those  of  the  photographs  given 
in  [43].  The  pure  necking  problem  can  also  be  given  a 
global  qualitative  analysis  [5]  under  arbitrary  boundary 
conditions . 

The  same  perturbation  results  enable  us  to  determine 
the  tensile  load  A = [Sa + Hb] • i as  a function  of  y. 

This  is  the  information  recorded  on  a tension  test.  We 


illustrate  the  typical  results  in  Fig.  6.2,  which  conform 
to  those  found  experimentally.  Note  that  the  positive  def- 
initeness of  (6.6a)  does  not  require  that  A corresponding 
to  the  trivial  solution  and  a fortiori  A corresponding  to 
a bifurcating  solution  to  be  a monotonical ly  increasing 
function  of  y.  The  elementary  theory  of  elastic-perf ectly 
plastic  materials  inspired  by  curves  like  Fig.  6.2  des- 
cribes these  instabilities  by  replacing  the  strictly  mono- 

A A 

tone  function  £ 3 (w)  by  one  in  which  = 0 on  an 

interval  of  the  form  [Y,-0  with  Y > 0.  (Y  corresponds 

to  v> 

If  the  material  is  hyperelastic,  then  the  problem  is 
variational.  By  using  the  theorems  of  [3,9],  we  can  show 
that  the  potential  energy  functional  has  a smooth  global 
minimizer  for  each  value  of  y.  It  can  be  shown  that  the 
second  variation  of  this  functional  at  the  trivial  solution 
is  not  positive-definite  for  y exceeding  the  smaller  of 


y and 
P 

in  this 


range . 


hus  the  trivial  solution  is  unstable  for 
This  notion  of  stability  can  be  made  pre- 


r 


cise  [ 22 ] . 


As  noted  above,  it  can  v/ell  happen  that  Y2  < This 

together  with  the  instability  results  strongly  suggest  that 
these  problems  will  exhibit  hysteresis  when  y is  slowly 
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Figure  6.2.  The  effect  of  bifurcation  on  the  load- 
extension  curve.  The  heavy  curve  cor- 
responds to  the  trivial  solution.  The 
curves  joining  it  at  the  heavy  dots 
correspond  to  the  bifurcating  branches. 

oscillated  in  a suitable  open  interval  containing 
[0,  min (y  »rq) ] . 

Thus  our  geometrically  exact  theory  of  nonlinearly 
elastic  rods  has  solutions  that  can  exhibit  some  of  the 
main  effects  of  plastic  rods:  shear  (or  yield-point)  in- 
stabilities, necking,  and  hysteresis. 

Bibliographic  Remarks.  The  results  described  here  are 
based  on  the  extensive  development  of  [4,5,11].  Paper  [4] 
gives  an  analytic  treatment  of  the  necking  problem  by  the 
Poincare  shooting  method,  paper  [5]  examines  the  global 
behavior  of  the  necking  problem  by  use  of  phase-plane  meth- 
ods. Paper  [11]  gives  a detailed  treatment  of  the  material 
described  here  except  that  it  replaces  (6.9c)  with  the  con- 
ditions M ( 0 ) = M(l)  = 0.  (By  the  positive-definiteness  of 
(6.6a)  and  by  (6.8),  these  are  equivalent  to  0'(O)  = 0'(1) 
= 0.) 
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7 . Buckling  of  Circular  Plates 

We  now  examine  the  axisymmetric  buckling  of  circular 
plates  under  a compressive  edge  thrust.  Our  approach  bears 
a marked  resemblance  to  that  of  Section  2,  but  the  analytic 
problems  are  strikingly  different. 

Let  (i,j,k)  be  an  orthonormal  basis  for  E3  and  let 
(s,<p)  be  polar  coordinates  for  the  (i,j)-plane.  Set 

(7.1)  e.  ((p)  = cos  ipi  + sin  tp j , e (ip)  = -sin  ipi  + cos  <p j . 

Ay  I Ay  Ay  Ay  2 Ay  Ay 

The  axisymmetric  configuration  of  a circular  plate  that  can 
suffer  flexure,  extension,  and  shear  is  determined  by  the 
pair  of  vector  functions 


(7.2a) 

(s,<p)  t (0,1]  1 

r(s,<p),  b(s,<p),  |b(s,<p)  | = 1 

with 

(7. 2b, c) 

r (s,(p)  • e_  (<p) 

Ay  Ay  ^ 

= 0,  b(s,tp)  -e  ( tp ) = 0, 

Ay  Ay  ^ 

(7 . 2d. e) 

r ( 0 , <p ) («p ) 

= 0,  b(0,cp)=k. 

A/  AJ 

We  set 

(7.3a) 

a(s,<p)  = cos 

6(s)e.(<p)  + sin  9(s)k, 

Ay  ^ Ay 

(7. 3b, c) 

b(s,ip)  = -sin 

efsje^ttp)  + cos  6 (s)  k,  9(0)  = 0, 

(7 . 4a , b) 

r (s  ,<p ) -e. 

(.p)  = r (s)  , r (0)  = 0, 

(7.5) 

ar 

(8'^  " 

[1+S  (s)  ]a(s,cp)  + h (s)b(s,tp)  , 

t 

(7.6) 

1 + T 

(s)  = r(s)/s, 

(7.7) 

O 

(s)  = sin  0 (s)  / s , 

(7.8) 

(s)  = 0 ' (s)  . 

The  strains  for  our  problem  are 
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(7-9)  w = (^,4,11,0,^)  . 

The  interpretation  of  4,  f],  p.  parallels  that  of  Section 
2:  4 and  g measure  extension  and  shear  in  the  radial 
direction;  u measures  the  flexure  about  e„.  t measures 
the  circumferential  extension,  a (which  bears  no  relation 
to  the  a of  Section  6)  measures  flexure  about  e^.  We 
require  that 

(7.10)  1 + 4 > 0,  1+xiO. 


(More  sophisticated  conditions  are  appropriate.  See 

[8,10]  . ) 

Let 


(7.11)  n (Sq,<Pq) 


(so)S(so'(po)  + H(so)fe(so'(po) 


be  the  force  per  unit  reference  length  and  let 


(7.1 2) 


S (s0'V 


(s0)e2  (tPg) 


be  the  couple  per  unit  reference  length  of  { (s,<p)  : s = sQ} 
exerted  by  {(s,<p)  : s > sQ}  on  {(s,<p)  ; s 5 sQ}  at 

(sO'<J>o*  ' Let 

( 7 ' 1 3 } S2(so'lpo)  3 T(s0)S2(<p0) 

be  the  force  per  unit  reference  length  and  let 

(7.14)  rn2(s0,4>0)  5 - z (sQ)  cos  0 (Sg),^  (<pQ)  + m2  (Sg.Vg) -k  k 

be  the  couple  per  unit  reference  length  of 
{(s,cp):  (p  = <pg]  exerted  by  {(s,q>):(Pg<(p<lpg-t-TT}  on 
{(s,ip)  : ipg  - n 5 cp  < <p g > at  (Sg,cpg).  The  representations 
(7. 11)- (7.14)  embody  the  requirement  that  the  resultants  be 
axisymmetric.  Assume  that  the  only  external  load  applied 
to  the  body  consists  of  reactions  to  geometric  boundary 
conditions  at  the  edge  and  a normal  pressure  applied  to  the 
edge : 
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(7.15)  n1(l,<p)  = -Xg  (r  (1)  )e.  (<p)  . 

(We  permit  this  normal  pressure  to  have  a hydrostatic  char- 
acter by  permitting  its  amplitude  to  depend  on  r(l).)  The 
classical  versions  of  the  equilibrium  equations  have  the 


form 

rl 

(7.16a) 

SS  (s) 

= - [ xg  (r  (1)  ) + 

s 

T (t) 

dt] cos 

0 (s)  , 

rl 

(7.16b) 

sH  (s) 

= ( Xg  (r  (1 ) ) + 

L 

T (t) 

dt] sin 

0 (s)  , 

(7.17) 

<sM)  1 - 

2 cos  0 

+ s[\g(r(l))  + T(t)dt]  ( (l+£)sin0  + risinO]  = 0. 

' s 

Compare  this  system  with  (2.9),  (2.10).  There  is  no  verti- 
cal edge  reaction  v in  (7.16),  (7.17)  because  its  pres- 

ence would  be  inconsistent  with  axisymmetry  and  equilibrium. 
Thus  the  delicate  analytic  questions  arising  from  the  non- 
vanishing of  v in  Section  2 are  not  present  for  our  plate 
problem.  On  the  other  hand,  the  explicit  presence  of  the 
radial  coordinate  s and  the  replacement  of  the  scalar  X 
of  (2.9)  by  the  bracketed  expression  of  (7.16)  are  the 
source  of  considerable  analytic  difficulty  that  we  describe 
below. 

We  assume  that  the  material  of  the  plate  is  homogen- 
eously elastic  (implicit  in  which  is  the  notion  that  the 
plate  is  of  uniform  thickness) . We  introduce  constitutive 
functions 

(7.18)  W I — ► T ( W ) , S (w)  , H (w) , i (w) , M (w) 

such  that 

(7.19a, ...  ,e)  T(s)  = T (w  (s)  ) , . . . ,fl  (s)  = M(w(s)). 


We  require  that 
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^ and 


be  positive-definite, 
of  the  strong  ellipticity 
able  to  supplement  this 


plate- theoretic  analog 
It  is  not  unreason- 
the  requirement  that 


(7. 20a, b) 


This  is  the 
condition, 
condition  with 


be  positive-definite.  This  is  a restricted  plate-theoretic 
version  of  the  Coleman-Noll  inequality  [61].  We  finally 
require  that  the  constitutive  functions  meet  coercivity 
conditions  like  (2.14)  and  invariance  conditions  like  (6.8). 

We  choose  the  boundary  condition 

(7.22)  0 (1)  = 0. 

Thus  our  boundary  value  problem  consists  of  the  "strain- 
displacement"  relations  (7. 5) -(7. 8),  the  equilibrium  equa- 
tions (7.16),  (7.17),  the  constitutive  equations  (7.19), 

and  the  geometric  boundary  conditions  (7.3c),  (7.4b), 

(7.22)  . 

Our  plan  is  to  solve  the  system  consisting  of  (7.5), 
(7.6),  (7.16),  (7.19a,b,c)  for  i , g as  functions  of 

9 and  X and  then  substitute  the  resulting  expressions 
and  (7.19d,e)  into  (7.17)  to  get  a second  order  functional- 
differential  equation  for  0 to  which  we  can  apply  the 
theory  of  Crandall  & Rabinowitz  [25]  supported  by  the  full 
power  of  the  Sturmian  theory.  Thus  we  must  solve  the 
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system 

(7.23a) 

fl 

sS  (w  ( s ) ) 

- [Xg (r (1)  ) + j 

1 

s 

T (w  (t)  ) dt  ] cos  G(s ) , 

(7.23b) 

A 

fl 

A. 

sH  (w(s)  ) 

[Xg (r  (1)  ) + j 

s 

T (w(t) ) dt]  sin  0(s) , 

(7.24) 

r (s) 

= s [ 1 + T 

(s)  ] 

fl 

= r (1)  - 

I, 

{ [1+E,  (t)  ] cos  9 (t) 

+ q (t)  sin  9 (t)  } dt. 

subject  to  the  strict  inequalities  (7.10),  when  9 is  a 
given  function  in  C1 ( (0,1) ) n C°([0,1])  satisfying  (7.3c), 
(7.22).  We  do  not  employ  the  alternative  integral  repre- 
sentation 

(7.25)  r(s)  = f {[l  + £(t)]cos9(t)+q(t)sin9(t)}dt 
1 0 

that  incorportates  (7.46)  for  (7.24)  because  it  presumes 
that  r is  absolutely  continuous  on  [0,1],  while  the 
singular  character  of  (7.23)  puts  such  a presumption  in 
doubt.  The  system  (7.23),  (7.24)  generalizes  that  of  Sec- 

tion 2 leading  the  solution  (2.20).  The  problem  of  Section 
2 is  just  a pair  of  algebraic  equations  that  are  resolved 
by  a global  implicit  function  theorem  relying  on  the  mono- 
tonicity of  the  constitutive  functions.  In  contrast,  we 
are  now  confronted  by  a bona  fide  system  of  integral  equa- 
tions. Using  the  positive-definiteness  of  (7.20b)  or  (7.21) 
we  could  convert  (7.23),  (7.24)  to  a variety  of  equivalent 

forms . 

Our  first  inclination  in  attacking  (7.23)  would  be  to 
formulate  (7.23)  as  an  operator  equation  of  the  form 
(£,q,r)  = F(£,t),r)  on  the  Banach  space  of  continuous  func- 
tions and  then  employ  Schauder ' s fixed  point  theorem.  But 
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it  seems  impossible  to  choose  F so  that  it  both  meets  the 
requirements  of  Schauder's  theorem  and  is  compatible  with 
(7.10)  and  the  linearity  of  (7.24).  We  are  consequently 
led  to  an  alternative  formulation  of  the  problem  in  the 
context  of  montone  or  pseudo-monotone  operators  on  Sobolev 
spaces.  In  such  a setting  the  disadvantages  of  (7.25)  be- 
come more  pronounced.  We  accordingly  formulate  a suitable 
weak  form  of  the  equations  without  (7.4b).  Then  in  place 
of  (7.4b),  the  regularity  theory  produces  the  natural  bound- 
ary condition 

(7.26)  lim  s2(w(s) ) = 0, 

s->0 

which  is  instrumental  in  ultimately  showing  that  (7.4b) 
holds.  If  (7.20L)  is  positive-definite,  then  the  operator 
appearing  in  the  weak  equations  is  pseudo-monotone  (cf. 
[41]),  and  an  existence  theory  is  obtained  without  severe 
difficulty.  If  (7.21)  is  positive-definite,  this  operator 
is  strictly  monotone  and  we  readily  get  both  existence  and 
uniqueness  of  the  solution.  The  regularity  theory  away 
from  s = 0 follows  the  lines  of  [6].  (The  full  power  of 
[9]  is  not  required  here;  much  of  the  effort  of  [9]  was 
devoted  to  showing  that  minimizers  of  certain  variational 
problems  satisfy  the  weak  Euler  equations.)  One  can  then 
use  the  regularity  of  solutions  on  [0,1]  to  show  that  the 
solutions  are  in  fact  regular  on  [0,1]  by  a careful  use 
of  differential  inequalities  along  the  lines  of  the  proof 
of  Nagumo's  uniqueness  theorem  [30]. 

Thus  we  can  reduce  our  full  system  to  a single  second- 
order,  ordinary-functional  differential  equation  for  0 
when  (7.21)  is  positive-definite  and  to  a family  of  such 
equations  when  (7.20b)  is  positive-definite.  The  result- 
ing equations  are  susceptible  of  a full  analysis  along  the 
lines  indicated  in  Section  2.  The  linearization  of  the 
final  equation  is  a Bessel  equation.  A uniqueness  theorem 
of  Nagumo  type  ensures  that  the  nodal  structure  that  a 
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branch  inherits  from  the  linearization  is  preserved  along 
the  entire  branch.  The  full  analysis  shows  that  the  family 
of  solutions  of  the  nonlinear  equations  exhibit  a budget  of 
pathologies  just  like  the  problem  of  Section  2. 

Bibliographic  Remarks.  The  material  of  this  section  is 
based  on  [10].  The  problem  was  inspired  by  Wolkowisky's 
[62]  beautiful  analysis  of  v.  Kerman's  equations.  (Also, 
see  [19-21] . ) 


8 . Other  Problems 

Preliminary  evidence  indicates  that  the  axisymmetric 
buckling  of  nonlinearly  elastic  spherical  shells  and  caps 
under  hydrostatic  pressure  can  be  given  a global  analysis 
that  modifies  the  treatment  of  Section  7 in  much  the  same 
way  that  the  treatment  of  Section  5 modifies  that  of  Sec- 
tion 2.  Some  variational  aspects  of  this  problem  were 
briefly  noted  in  [2].  See  [18]  for  a fascinating  and  com- 
plete numerical  study  of  this  problem  in  the  context  of  v. 
Karman  theory. 

Among  the  most  interesting  remaining  buckling  problems 
for  nonlinearly  elastic  structures  are  three  which  are  gov- 
erned by  quasilinear  systems  of  partial  differential  equa- 
tions: (i)  the  buckling  of  a plate  of  any  shape  (e.g.,  rec- 

tangular) under  the  action  of  in-plane  edge  loads,  (ii)  the 
non-axisymmetric  buckling  of  spherical  shells  under 
hydrostatic  pressure,  and  (iii)  the  buckling  of  cylindrical 
shells  under  edge  loads  acting  parallel  to  the  generators. 
(Within  the  context  of  the  v.  Karman  theory.  Problem  (i) 
has  been  thoroughly  analyzed  in  [19,21]  by  means  of 
Lyusternik-Shnirel 'man  category  theory,  and  Problem  (iii) 
has  been  briefly  analyzed  in  [20].) 

Heretofore,  one  of  the  main  difficulties  obstructing 
the  analysis  of  geometrically  exact  theories  of  nonlinearly 
elastic  shells  has  been  the  choice  of  suitable  constitutive 
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restrictions:  The  strong  ellipticity  condition,  despite  a 

number  of  attractions,  has  proved  too  weak  for  analysis, 
the  analytic  virtues  of  Coleman-Noll  conditions  were  diffi- 
cult to  perceive,  while  the  monotonicity  condition,  which 
can  support  a relatively  simple  analysis,  also  ensures  the 
physically  unsuitable  uniqueness  of  solutions  to  problems 
in  which  the  loading  is  dead.  (In  three-dimensional  theor- 
ies, the  principle  of  frame-indifference  implies  that  semi- 
or  pseudo-monotone  constitutive  functions  must  be  monotone. 
Similar  results  apply  to  shell  theories.)  A felicitous 
resolution  of  these  difficulties  has  just  been  provided  by 
Ball  [17],  who  proposed  conditions,  somewhat  more  restrict- 
ed than  strong  ellipticity,  which  are  both  convenient  for 
analysis  and  physically  reasonable.  With  such  conditions, 
the  Problems  (i)-(iii)  seem  to  be  tractable  either  by  the 
methods  of  Rabinowitz  [50-52]  or  by  Lyusternik-Shnirel 'man 
category  theory. 
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Bifurcation  Theory  in  Non-Linear 
Hydrodynamic  Stability 

J.  T.  Stuart 


1 . Introduction 

Nonlinear  hydrodynamic  stability  theory  is  really  concerned, 
ultimately,  with  phenomena  such  as  transition  to  turbulence.  In  practice, 
however,  that  phenomenon  is  so  complex  as  to  defy  rational  understanding  at 
the  present  time.  A more  limited  objective  is  that  of  gaining  some 
understanding  of  nonlinear  processes  in  fluid  mechanics,  perhaps  with  reference 
to  the  early,  relatively-simple  stages  of  the  evolution  of  laminar  flow  to 
turbulence.  Even  then,  the  mathematical  problems  posed  are  challenging 
enough . 

Attention  has  therefore  been  focussed  in  the  literature  on  o number  of 
prototype  problems,  namely,  those  of  Taylor  vortices  between  concentric 
rotating  cylinders,  Mnard  cells  in  a convecting  fluid  layer  heated  from  below, 
Poiseuille  flow  between  parallel  planes  and,  quite  recently,  the  oscillatory 
Stokes'  layer.  In  this  lecture  an  attempt  will  be  made  to  describe  the 
experimental  observations,  the  mathematical  attempts  to  explain  them,  the 
results  obtained  and  important  questions  which  remain  open. 

It  is  appropriate  in  this  Introduction,  and  may  be  helpful,  to  say 
something  of  the  observations,  so  as  to  give  some  justification  for  the 
mathematical  structures  which  are  assumed  or  sought. 

(a)  Some  90-100  years  ago.  Lord  Rayleigh  and  others  were  interested  in 
the  problem  of  the  instability  of  a jet  in  a liquid.  In  experiment  a sinuous 
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wave  can  develop,  which  evolves  in  the  direction  of  flow  and  takes  on  a highly  1 

contorted  form.  The  theory  of  Rayleigh  considered  the  development  of  a j 

sinuous  normal  mode,  developing  from  and  propagating  in  a "parallel"  flow 

appropriate  to  a local  region;  both  viscosity  and  nonlinearity  were  neglected;  ^ 

but,  more  especially  from  the  present  viewpoint,  the  non-parallel , or 
developing  aspect  of  the  flow  was  neglected  also. 

(b)  During  the  present  century  a good  deal  of  interest  has  focussed  on  the  i 

aerodynamical  boundary  layer,  through  the  theoretical  work  in  Germany  of 

Prandtl , Heisenberg,  Tollmienand  Schlichting  in  the  1920sand  1930s, 

followed  by  experimental  observations  and  further  theoretical  developments  in  , 

the  United  States,  Europe,  Japan  and  elsewhere.  Again,  os  in  Rayleigh's 
work  on  instability  of  a jet,  attention  has  been  directed  towards  sinuous 

propagating  waves,  of  the  form  which  were  first  observed  by  Schubauer  and  < 

Skramstad  at  the  National  Bureau  of  Standards  in  Washington  in  1943. 

Theoretical  developments  have  been  based  on  the  " parallel -flow"  approximation, 
but  with  viscosity,  weak  nonlinearity  (for  small  amplitude  of  oscillation)  and 
flow  development  taken  into  account.  Three-dimensional  boundary-layers  have 
also  received  attention,  especially  in  connexion  with  swept-wings  and  with 
geophysical  Ekman  layers.  One  example  is  that  of  flow  due  to  a rotating  disk, 
where  a " logarithmic-spiral*  vortex  pattern  appears  in  the  boundary  layer  (see 
frontispiece  to  Rosenhead  (1963)).  In  this  case  theoretical  work  has  been  based 
on  linearized  equations  in  a "parallel-flow"  approximation. 

(c)  Taylor  vortices  occur  in  curved  flows,  and  especially  between 
concentric  rotating  cylinders  when  the  outer  is  at  rest  but  the  inner  one  is 
rotating.  The  cylinders  are  often,  but  not  always,  long  compared  with  the  gap 
between  them.  A parameter,  the  Taylor  number,  which  depends  upon  angular 
speed,  radius  of  curvature,  gap  width  and  kinematic  viscosity,  determines 
whether  the  toroidal  Taylor  vortices,  spaced  regularly  along  the  axis,  occur  or 
not.  They  do  occur  if  a critical  Taylor  number  is  exceeded,  and  they  have  a 
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definite  wavelength.  At  a higher  Taylor  number  still  the  vortices  become  time- 
dependent  and  wavy  and,  in  fact,  travel  around  the  annulus.  These  first  and 
second  transitions  are  followed  by  others  at  still  higher  Taylor  numbers. 

Variations  on  the  Taylor-vortex  theme  include  effects  of  non-concentric 
axes  of  the  cylinders,  when  the  critical  Taylor  number  is  known  to  depend  on 
the  eccentricity,  and  the  effects  o',  .ime-dependent  angular  velocity,  when  the 
critical  Taylor  number  depends  on  the  modulating  amplitude  and  frequency. 

Both  these  features  have  been  the  subject  of  theoretical  enquiry,  with  some 
degree  of  success . 

One  aspect  of  the  experimental  observations,  which  has  not  yet  been 
treated  adequately  by  theorists,  is  that  concerned  with  end  effects.  Many 
experiments  have  cylinders  whose  length  is  several  hundred  times  the  annular 
gap,  but  several  recent  observations  have  been  made  with  cylinders  only  some 
20  or  30  gap-widths  in  length.  It  is  clear  from  very  recent  work  of  Cole 
(1974a,b)  that  the  effects  of  the  ends  can  be  considerable  and  dramatic, 
especially  with  reference  to  the  development  of  wavy  vortices.  Both  the 
critical  Taylor  number  and  the  critical  axial  wave  number  for  the  second 
transition  are  strongly  influenced  by  the  end  boundary  conditions. 

(d)  B^nard  cells  can  occur  in  very  simple  experimental  configurations,  such 
as  a horizontal  layer  of  liquid.  The  lower,  rigid  boundary  is  heated,  while 
the  upper  boundary,  which  is  also  horizontal,  can  be  rigid  or  free  and  is 
cooled  relative  to  the  lower  boundary.  A cellular  convection  pattern  is 
possible  due  to  gravity  acting  on  the  density  variations,  and  is  characterized 
by  the  Rayleigh  number,  a parameter  which  depends  on  expansion  coefficient, 
temperature  gradient,  acceleration  due  to  gravity,  distance  between  the  planes, 
kinematic  viscosity  and  thermometric  conductivity.  Alternatively  we  note 
that  the  cellular  motion  can  also  be  driven  from  a free  surface  by  the  dependence 
of  surface  tension  on  viscosity. 
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In  many  of  the  problems  outlined  above,  the  occurrence  of  a critical 
parameter  (for  which  we  use  the  generic  term  "Reynolds*  number)  for  the 
linearized  perturbation  problem  implies  a bifurcation  for  the  nonlinear  equations. 
Such  bifurcations  have  frequently  been  calculated  for  idealized  cases  in  which 
the  geometry  extends  to  infinity  in  one  or  more  coordinates.  The  amplitude  of 
the  perturbation  is  zero  up  to  the  critical  Reynolds  number,  but  thereafter  is 
zero;  in  the  notation  of  Benjamin  (1976a)  the  supercritical,  subcritical  and 
transcritical  cases  of  bifurcation  are  illustrated  schematically  in  Fig.  1 (a),  (b),  (c) . 


Fig.l  . Amplitude  (A)  against 
Reynolds  number  (R): 

(a)  Supercritical; 

(b)  Subcritical; 

(c)  Transcritical. 


Such  problems,  which  lead  to  bifurcations  of  the  type  illustrated  in  Fig.l  are 
sometimes  known  as  " perfect"  , a term  prevalent  in  the  literature  of  the  theory 
of  elastic  stability.  If  small  "imperfections"  are  added,  such  as  that  of 
allowing  for  a large  but  finite  container,  the  problem  is  sometimes  known  as 
" imperfect"  . In  many  cases  there  is  no  strict  bifurcation:  rather,  the 
amplitude  follows  a smooth  path,  shown  dotted,  close  to  one  of  the  bifurcating 
amplitude  curves  of  Fig.l  , The  work  of  Matkowsky  and  Reiss  (1976),  as 
explained  by  Professor  Reiss  at  this  meeting,  deals  with  such  problems.  On 
the  other  hand,  there  are  known  examples  in  fluid  mechanics  where  an 
" imperfection"  (i)  allows  but  shifts  the  bifurcation  and  (ii)  possibly  changes 


NON-LINEAR  HYDRODYNAMIC  STABILITY 


131 


the  structure  between  Fig.  1(a),  (b)  or  (c)>  examples  include  eccentricity  of  the 
cylinders  in  the  Taylor-vortex  problem  (DiPrima  and  Stuart  1975)  and  viscosity 
variations  with  temperature  in  the  B6nard  problem  (Segel  and  Stuart  1962). 

2.  Equations  of  Motion 
These  are 


(2.D 


0V 

aT 


1 A 


if  = -V(H  + Iv?)  - Vcuri  f , 

p 2 - 


(2.2)  r = curl  v , 

(2.3)  div  v = O , 

representing  the  equations  of  momentum,  vorticity  (?)  and  continuity  for  an 
incompressible  fluid  of  constant  kinematic  viscosity  {V  ),  where  v is  the 
velocity,  p the  pressure,  t the  time  and  p the  density.  For  discussions  of  the 
B4nard  and  related  problems,  it  is  necessary  also  to  consider  the  "energy" 
equation  for  the  temperature  distribution  and  also  an  equation  of  state.  The 
set  of  equations  then  needs  to  be  solved  with  appropriate  boundary  and  initial 
conditions. 

3.  Prototype  problems 

In  order  to  explain  in  a relatively  simple  way  the  progress  which  has 
been  made,  we  shall  treat  four  prototype  problems  first  and  then  allow  for 
recent  and  important  variations  in  Section  4.  These  four  problems  are  (o) 
plane  Poiseuille  flow,  (b)  circular  Couette  flow  between  rotating  concentric 
cylinders,  (c)  B6nard  (gravitational)  convection  and  (d)  the  Stokes  layer, 

(a)  Plane  Poiseuille  flow  is  driven  between  two  parallel  planes  by  a uniform 

pressure  gradient  acting  in  the  x direction.  The  flow  is  unidirectional,  with  a 

2 + 
velocity  distribution  u(  y ) = 1 -y  " , which  is  zero  at  each  of  the  planes  y = - 1 . 

The  Reynolds  number  is  R = U h/17  , where  Uq  is  the  maximum  velocity  and 

2h  the  distance  between  the  planes.  Perturbations  proportional  to 
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exp  i(ax  + yz  - act)  , where  z is  the  transverse  or  spanwise  coordinate, 

must  satisfy  the  classical  Orr-Sommerfeld  equation.  If,  in  accordance  with 

one  convention,  we  insist  on  a and  y being  real,  then  the  imaginary  part  of  c 

(=c  + i c.)  yields  instability  if  c.  >0.  The  neutral  curve  defined  by  c.  = O 
r i ' ' i 

is  illustrated  in  Figure  2,  of  which  further  theoretical  details  can  be  found  in 

the  article  by  Stuart  (1  963).  Within  the  curve  , c.  > O , and  there  is  a 

temporal  instability.  The  critical  Reynolds  number  (R^_)  is  about  5772  with  a 

critical  wave  number  a = a =1  .02  at  y = O. 

c ' 

More  recently  Nishioka,  lida  and  Ichikawa  (1  975)  have  given 

substantial  experimental  confirmation  of  Figure  2 in  the  Reynolds  number  range 

6500-7500.  Furthermore  the  work  of  Reynolds  and  Potter  (1967),  Pekeris 

and  Shkoller  (1967),  Chen  and  Joseph  (1973)  and  Itoh  (1974),  based  on  the 

earlier  work  of  Stuart  (1960)  and  Watson  (1960),  indicates  the  bifurcation  at 

R = R , a = a to  be  of  the  subcritical  form  (Fig.  1(b)).  Consequently  there  is 
c c 

instability  possible  at  R < R^  if  a threshold  amplitude  is  exceeded.  An 
encouraging  comparison  between  theory  and  experiment  for  R = 5000  is  shown 
in  Fig. 3,  where  the  threshold  amplitude  of  Itoh’s  version  of  the  theory  is 
compared  with  the  observations  of  Nishioka  et  al.  This  experimental  work  is 
of  considerable  importance  in  giving  confidence  in  the  basic  theoretical  ideas. 


Fig. 2.  Neutral  curve  for  plane 
Poiseuille  flow. 


Fig, 3.  Threshold  amplitude  (Af) 
against  frequency  (ac^). 


L 
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(b)  Circular  Couette  flow  occurs  between  two  concentric  cylinders  of  infinite 

length  in  differential  rotation.  If  the  outer  cylinder  is  at  rest,  but  the  inner 

one  rotates,  perturbation  modes  of  the  form  exp((T  t)  cos  Xz  occur  if  the  Taylor 

number,  T,  exceeds  a critical  value  T , where  z is  the  axial  coordinate}  here 

c 

X,  the  wave  number,  and  <T  , the  growth  rate,  are  known  from  computations 

to  be  real.  From  linearized  theory  it  is  known  that  = O on  a neutral 

curve  shown  schematically  in  Fig. 4,  with  instability  ( > O)  above  the 

curve.  The  critical  Taylor  number  T is  about  1695  with  the  critical  wave 

number  X^_  about  3.13.  Many  calculations,  by  Stuart  (1958),  Davey(196?), 

Kirchgassner  and  Sorger  (1969)  show  that  the  bifurcation  at  T , X is  of 

c c 

supercritical  type  (Fig.  1(a)),  and  there  is  ample  confirmation  by  experiment 
within  the  limits  of  experimental  error. 


Fig. 4.  Neutral  curve  for  circular  Couette  flow. 


(c)  B4nard  (gravitational)  convection  occurs  between  two  horizontal  planes, 
when  the  lower  one  is  heated,  provided  that  the  Rayleigh  number,  Ra,  exceeds 
a critical  value  Ra  . A variety  of  plan  forms  is  possible.  The  neutral  curve 
<T  = O for  perturbation  cellular  modes  of  the  form  exp(  <T  t)  h(x,y),  where 
h is  periodic  in  x and  y,  is  of  the  form  of  Fig. 4,  with  Ra  replacing  T and  X 
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being  a representative  wove  number  within  h(x,y).  This  " perfect'"  problem, 
with  constant  viscosity,  is  of  supercritical  type  (Fig,  1(a))  in  its  bifurcation 
structure . 

(d)  The  Stokes  layer  occurs  on  an  infinite  flat  wall  which  oscillates  in  its 
own  plane  with  a frequency  dt)  rad/sec.  The  thickness  of  the  layer  is  of 

L 

order  ( V /Oi  )2  and  the  velocity  parallel  to  the  wall  (y  =0)  is  of  the  form 
exp(-y)cos(TT -y),  where  = A)  t.  Perturbations  of  the  form 
l|J(y,T)exp  ia(x-ct)  , with  a real  and  c complex,  satisfy  a partial 
differential  equation  closely  related  to  the  Orr-Sommerfeld  ordinary  equation. 
An  appropriate  Reynolds  number  defined  by  Hall  (1976)  is  R = UQ(2/yW  )2. 
Since  there  is  a semi-infinite  domain  of  flow  0<  y < ao,  there  is  a continuous 
spectrum  of  eigenvalues  c (with  c = O) , for  which  the  eigenfunctions 
t|J(y,  t ) °re  bounded  but  not  zero  as  y — > co.  In  addition,  for  certain  ranges 

of  Reynolds  number,  at  a given  wave  number,  there  is  a discrete  spectrum, 
with  c^  / O.  An  example  from  Hall's  paper  is  given,  illustratively  in  Fig, 5 
for  a = 0,15.  All  the  eigenvalues,  both  of  the  continuous  and  discrete 
varieties,  have  negative  values  for  c. , so  that  the  modes  are  damped. 
Moreover,  at  the  points  A,  8,  C,  D of  Figure  5 the  discrete  eigenvalues  and 
eigenfunctions  merge  into  the  continuous  spectrum,  with  eigenfunctions  which 
are  merely  bounded  at  infinity.  Hall1  s method  of  analysis  is  a combination 
of  Floquet  theory  with  numerical  analysis.  (An  earlier  analysis  of  Kerczek 
and  Davis(l  974)for  a Stokes'  layer  between  two  planes  gave  results  which  do 


not  reduce  to  the  discrete  modes  mentioned  above,  when  the  stationary  plane 
recedes  to  infinity.) 
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Fig. 5.  Discrete  spectrum  for  Stokes'  layer, 

A curved  Stokes  layer  is  developed  on  a circular  cylinder  of  infinite 

length,  which  oscillates  torsional ly  about  its  axis  with  frequency  CO  . The 

2 

Stokes  layer  is  thin  compared  with  the  radius  a if  W a / » 1 , as  is 

assumed.  This  flow  is  centrifugally  unstable,  in  a way  similar  to  circular 

Couette  flow,  and  Taylor-type  vortices  are  developed  provided  the  Taylor 

number  exceeds  a critical  value  (plate  2 of  Seminara  and  Hall  1976), 

Theoretical  work  by  the  same  authors  shows  that  the  Taylor  number 

2 A a '(&>/>'  )2,  where  A is  the  tangential  displacement  of  the  cylinder, 

varies  with  wave  number  for  a neutral  mode  in  the  way  illustrated  by  Fig, 4. 

Here,  however,  T is  164,  while  A isO„86>  experimental  observation  gave 
c c 

150  and  0.88  for  these  parameters.  In  a later  paper  Seminara  and  Hall  (1977) 
show  that  the  nonlinear  bifurcation  is  of  supercritical  type  Fig.  1 (a) . 

4.  Variations  on  the  prototype  problems 

In  this  section  we  consider  a number  of  modifications  to  the  prototype 
problems,  in  order  to  illustrate  other  aspects  of  the  bifurcation  problem. 

These  are  (a)  slow  variations  or  small  changes  leading  to  a shift  of  the  bifurcation 
Reynolds  number,  (b)  the  spectrum  of  modes  following  a bifurcation,  (c)  secondary 
bifurcations  and  (d)  "end"  and  other  changes  which  prevent  bifurcation 
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t 


occurring  and  replace  it  by  a smooth  amplitude  change  (the  so-called  " perturbed" 
bifurcation  theory). 

(а)  Slow  variations  (i)  eccentric  circular  cylinders 

DiPrima  and  Stuart  (1972,  1975)  have  studied  theoretically  the 
generation  of  Taylor  vortices  between  two  infinitely-long  cylinders  when  the 
axes  are  parallel  to  each  other  but  not  coincident.  In  this  problem  they  assume 
two  parameters  to  be  small,  namely  6,  the  ratio  of  gap  to  typical  radius,  and 
the  measure  of  the  displacement  of  the  axes.  Thus  both  the  rate  of  change 

(б)  of  the  flow  round  the  annulus,  and  the  overall  change  (£  ) are  assumed  to 

be  small.  By  assuming  that  6 and  £ are  related  by  6^  = 2K  £ for  the  case 

of  the  inner  cylinder  rotating  and  the  outer  one  at  rest,  with  £ — > O and 

K fixed,  the  authors  were  able  to  show  in  their  1972  paper  that  the  critical 

Taylor  number,  T , varies  with  £ in  the  way  shown  by  Fig. 6.  For  T > T 
c c 

Toylor  vortices  occur,  and  experimental  observations  are  in  substantial  agreement 
with  theory  for  £ <0.4,  with  reference  to  the  values  of  T predicted  by 
Fig. 6.  In  the  nonlinear  calculation,  DiPrima  and  Stuart  (1975)  showed  that 
the  bifurcation  is  supercritical  (Fig.  1(a))  but  shifted  in  T according  to  Fig. 6. 

The  Taylor-vortex  strength  is  predicted  to  have  its  maximum  value  at  an 
azimuthal  angle  of  about  50°  from  the  place  of  maximum  gap,  a result  in 
accordance  with  experiment. 


Fig. 6. 


Critical  Taylor  number  (T  ) against  eccentricity 

(£). 
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(a)  Slow  variations  (ii)  Blasius  flow 

In  contrast  to  the  scheme  of  approximation  mentioned  above,  slow 
variations  in  boundary  layers  are  usually  associated  with  an  0(1)  change  in  the 
flow  itself.  Thus  if  in  (i)  it  were  decided  to  study  the  problem  with  £ of 
order  1 , a different  scheme  of  approximation  would  be  needed.  Such  a method 
is  the  W.K.B.  J.  scheme,  which  has  been  applied  recently  by  Bouthier  (1973), 
Nayfeh  (1974),  Gaster  (1974)  and  Eagles  and  Weissman  (1975)  to  the  problem 
of  the  stability  of  the  developing  Blasius  and  other  boundary  layers,  with 
linearized  perturbations.  Comparison  between  theory  and  experiment  in  the 
sense  of  a diagram  like  Fig. 2 is  improved  by  including  flow  divergence,  and  a 
bifurcation  still  arises  (Gaster  1974).  However,  as  Gaster  has  rightly  pointed 
out,  the  amplication  rate  in  the  x coordinate  depends  on  the  property  being 
considered.  This  is  important  for  the  comparison  with  experiment. 

(a)  Slow  variations  (iii)  Convection 

Many  authors  (see,  for  example,  Segel  and  Stuart  1962)  have  shown 

that  the  variation  of  viscosity  with  temperature  can  be  important  near  to  the 

critical  Rayleigh  number  for  convection  in  a layer  of  fluid  heated  from  below. 

For  example,  in  the  particular  case  of  convection  in  hexagonal  cells  a 

bifurcation  point  exists,  but  is  changed  from  the  supercritical  case  (Fig.  1(a)) 

to  the  Transcritical  Form  (Fig.  1(c)).  Thus  instability  is  possible  at  finite 

amplitude  for  Rayleigh  numbers  below  Ra  . 

c 

(b)  The  Spectrum  of  modes  following  bifurcation 

As  can  be  seen  from  Figs. 2 and  4 for  the  particular  examples  of  plane 
Poiseuille  flow  and  circular  Couette  flow,  once  the  critical  point  is  exceeded 
(R  or  T ) a band  of  wave  numbers  is  possible  for  which  exponential  growth 
takes  place  at  small  amplitudes.  At  larger  amplitudes,  for  which  the  nonlinear 
terms  are  important,  there  is  a competition  between  modes  of  different  wave 
numbers.  This  competition  has  been  described  in  various  ways,  but  perhaps 
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most  successfully  in  terms  of  the  amplitude  or  evolution  equation  introduced  in  a 
general  way  by  DiPrima,  Eckhaus and  Segel  (1971),  Newell  and  Whitehead 
(1971)  and  Stewartson  and  Stuart  (1971), 

This  topic  is  perhaps  best  discussed  in  terms  of  the  case  of  plane 
Poiseuille  flow.  Suppose  we  consider  a perturbation  wave  of  the  form 
exp  ia^x-c^t)  A(  ? , •*),  where  the  suffix  c denotes  the  critical  point 
(V  Rc)  Fig. 2,  and  the  suffix  r denotes  the  real  part.  Moreover 

'V*'  2 t 2 

u L t and  S ~ £ (x-c  t),  where  £ = a c.  is  the  growth  rate 

9 c i 

at  a Reynolds  number  slightly  above  R^,  with  wave  number  a and  group 
velocity  c . If  £ is  sufficiently  small  it  can  be  shown  that  the  Amplitude 
function  A(  % , ~C)  satisfies  the  equation 

9 

/a  i \ 0A  0 A 7 

(4.1)  ~ -a?_  = <rA-kAIAI2, 

3 * 

where  <r*c  is  a real  number  and  a^  and  k are  in  general  complex  numbers. 

Several  phenomena  of  interest  can  be  discussed  in  the  context  of  this  parabolic 

differential  equation:  (i)ifk  >0,  <T  >0  then  an  equilibrium  solution 
9 r -1  c 

of  (4.1)  exists  with  IA|  = <r ^ k^  . Whether  it  is  stable  or  not  depends 
on  the  Eckhaus-Benjamin-Feir  instability  mechanism,  according  to  the  value 
of  a^  and  the  complex  numbers  a?  and  k (Newell  and  Whitehead  1969, 

Stuart  and  DiPrima  1976),  (ii)  If  k^  < O and  for  certain  ranges  of  the 
parameters  a^  and  k,  a focussed  bursting  phenomenon  can  occur,  in  which 
the  amplitude  grows  to  large  values,  outside  the  range  of  validity  of  (4.1),  but 
in  a finite  time  (Hocking  and  Stewartson  1972). 

In  the  case  of  circular  Couette  flow  and  B4nard  convection,  equation 
(4.1)  is  valid  but  with  a^  and  k both  real.  Work  on  the  question  of  the 
stability,  and  of  the  preferred  mode,  has  been  done  by  Kogelman  and  DiPrima 
(1970)  and  by  Busse  (see,  e.g..  Clever  and  Busse  1974),  though  the  work  of 
the  latter  is  not  based  on  (4.1), 
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(c)  Secondory  bifurcations 

One  of  the  most  interesting  examples  of  such  phenomena  in  fluid  flow  is 
that  of  the  development  of  wavy  vortices  between  rotating  circular  cylinders. 

It  is  known  from  observations  (Coles  1965)  that  the  Taylor-vortex  flow  is  replaced 
as  a stable  form  of  flow  by  a doubly-periodic  flow,  periodic  in  both  the 
azimuthal  and  axial  coordinates  and  in  time.  Calculations  of  instability  by 
Davey,  DiPrima  and  Stuart  (1968)  and  by  Eagles  (1971)  have  given  a value  for 
the  Taylor  number,  at  the  secondary  bifurcation  point,  in  reasonable  agreement 
with  experiment.  In  a later  calculation,  of  the  torque  on  the  inner  cylinder. 
Eagles  (1974)  showed  that  this  is  lower  than  it  would  be  if  the  flow  had 
remained  of  Taylor-vortex  form,  as  shown  in  Figure  7.  Additional  work  on  this 
problem  has  been  done  by  Nakaya  (1975)  and  Weinstein  (1977). 


Fig. 7.  Torque  against  Taylor  number  (T) 

(a)  Laminar  flow,  (b)  Taylor-vortex  flow, 

(c)  wavy-vortex  flow*  dotted  lines  unstable  . 


Further  work  is  needed  on  this  problem  in  the  neighbourhood  of  the  secondary 
bifurcation  point:  Eagles  (1974)  found  that  the  best  fit  of  torque  with 
experiment  was  obtained  by  assuming  4 waves  in  the  azimuth,  as  in  Coles 
observations.  No  calculation  of  nonlinear  wave  interactions  has  been  made 
to  justify  this  assumption. 
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(d)  Effects  of  fi  nite  geometry 

It  has  been  known  for  some  time  that,  in  experiments  on  instabilities  of 
fluid  flows,  the  lateral  boundaries  (which,  in  theoretical  work,  are  assumed  to  i 

be  at  infinity)  can  have  a considerable  influence  on  the  development  of 
transitions,  or  bifurcations,  from  one  state  of  flow  to  another.  For  example, 
several  authors  have  commented  on  the  occurrence  of  'ghostly"  Taylor 

I 

vortices  between  concentric  cylinders  of  finite  length  at  speeds  well  below  the 
critical  Taylor  number  calculated  from  linearized  theory  for  cylinders  of 
infinite  length  (Cole  1974a, b,  Jackson,  Robati  and  Mobbs  1975,  Benjamin 

1976b).  In  addition.  Cole  (1976)  has  shown  that,  whereas  the  critical  1 

Taylor  number  for  Taylor  vortices  (first  bifurcation)  is  little  affected  by  the 

length  of  the  cylinders,  the  critical  Taylor  number  for  the  occurrence  of  wavy 

vortices  (secondary  bifurcation)  is  strongly  dependent  on  the  length  when  this 

is  less  than  about  ten  times  the  gap  between  the  two  cylinders  (Figure  8). 

No  theoretical  explanation  has  yet  been  given  for  this  latter  phenomenon. 


Fig. 8.  Critical  Taylor  number  against  length  (L)  of  cylinders 
for  (a)  Taylor  vortices  and  (b)  wavy  vortices. 

We  return  now  to  the  matter  of  the  "ghostly"  incipient  vortices,  which 
are  known  to  occur  at  low  speeds.  In  various,  though  limited,  ways  this 
phenomenon  has  been  the  subject  of  theoretical  investigations  by  Joseph  (1971), 
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Benjamin  (T976a,b),  Kelly  and  Pal  (1976),  Daniels  (1976)  and  Hall  and  Walton 
(1976).  A summarized  view  of  the  present  position  can  best  be  given  by 
reference  to  Fig.  1(a)  and  to  Figure  9 which  is  given  below.  In  the  case  of 
cylinders  of  infinite  length  a bifurcation  of  supercritical  form  occurs  (Fig.  1(a)), 
from  which  stable  Taylor  vortices  occur.  When  the  cylinders  are  of  finite 
length,  however,  it  is  believed  (though  is  in  no  sense  yet  proven)  that  a smooth 
transition  takes  place,  as  shown  in  the  amplitude  diagram  of  Figure  9.  This 
view  is  in  accordance  with  long  established  work  in  elastic  stability  theory, 
dating  from  W.T.  Koiter's  thesis  at  Delft  in  1945  (see  also  E.L.  Riess's 
contribution  in  this  volume,  and  Matkowsky  and  Riess  1976). 


Thus,  the  first  (Taylor-vortex)  bifurcation  is  replaced  by  a smooth  change,  so 
that  Taylor  vortices  are  possible  at  all  speeds,  albeit  only  weakly  at  low  speeds. 
We  note,  furthermore,  the  observation  of  Cole  (1974a,b)  that  the  Taylor 
vortices  occur  first  at  the  ends  and  then  spread  inwards  towards  the  middle 
region.  In  Benjamin's  apparatus  (1976b)  only  two  vortices  occur  at  lower 
speeds,  starting  from  rest,  and  this  continues  for  a range  of  speeds  until  a true 
(secondary)  bifurcation  occurs  to  a flow  with  four  vortices.  Further  bifurcations 
occur  at  still  higher  speeds,  and  are  described  by  Benjamin  in  some  detail. 
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From  a theoretical  point  of  view  not  a yreat  deal  can  yet  be  said  with 
certainty  about  instability  of  flows  in  a bounded  domain.  In  1971  Joseph 
discussed  buoyancy-driven  convection  in  a container  with  some  arbitrariness  in 
geometry,  but  with  the  crucial  condition  that  the  boundary  conditions  on  the 
container  permit  a simple  solution  for  the  temperature,  with  no  convection. 
Joseph  then  shows,  by  "formal"  arguments,  that  the  bifurcation  is  of  the 
" transcritical"  type  (Fig. 1c)  with  effects  such  as  (a)  viscosity  variation  with 
temperature  and  (b)  internal  heating.  However,  when  the  two  latter  effects 
are  zero  the  bifurcation  is  supercritical  (Fig. la).  Benjamin  (1976a, b)  seems 
to  have  misread  Joseph' s paper  when  he  says  (1976a,  p.386)  that  " Joseph  has 
shown  that  the  effect  of  lateral  boundaries  is  to  change  the  bifurcation  from  the 
supercritical  to  the  transcritical  form"  ; conditions  of  symmetry  (as  in  Joseph's 
problem)  can  ensure  that  the  supercritical  form  is  preserved,  even  with  lateral 
boundaries  present. 

The  situation  is  more  complex  still  when  the  boundary  conditions  do  not 
permit  a simple  solution.  Such  a situation  appears  to  be  the  case  especially  in 
the  experimental  situation  of  Taylor  vortices  described  by  Benjamin  (1976b). 

In  his  earlier  paper  (1976a),  Benjamin  has  shown  that,  when  a bifurcation  takes 
place  from  a basic  solution  (which  will  itself  include  "vortices"  in  general), 
the  bifurcation  is  transcritical.  Schauder-Leray  degree  theory  is  applied  with 
two  assumptions:  (a)  that  the  eigenvalue  is  simple;  (b)  that  a certain  non- 
linear expression  (a)  is  non  zero.  The  latter  assumption  is  tantamount  to 
assuming  that  the  bifurcation  is  transcritical  rather  than  supercritical.  When 
symmetry  dictates  that  a = O , however,  the  bifurcation  will  be  supercritical , 
as  in  Joseph's  example  quoted  above.  It  seems  that  Benjamin's  view,  that 
bifurcations  in  closed  domains  are  normally  transcritical,  overstates  the  case; 
supercritical  bifurcations  can  occur. 
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We  now  turn  to  the  matter  of  the  calculation  (by  so-called  “formal" 
means)  of  the  basic  solution  when  a simple  solution  (without  vortices)  cannot  be 
obtained.  This  has  not  been  done  for  the  Taylor-vortex  problem.  Calculations 
have,  however,  been  made  by  Daniels  (1976)  and  by  Hall  and  Walton  (1976) 
for  convection  in  a two-dimensional  box  with  the  Rayleigh  (free  surface  and 
given  temperature)  conditions  on  the  top  and  bottom  horizontal  surfaces  of  the 
box,  but  with  zero  velocity  and  a specified  normal  temperature  gradient  on  each 
of  the  vertical  end  walls.  These  latter  conditions  simulate,  in  a mathematical 
way,  the  ends  of  the  cylinders  in  a Taylor  vortex  experiment.  Daniels'  work, 
which  considers  the  case  when  the  distance  between  the  end  walls,  L,  is  very 
large  compared  with  the  depth,  d,  of  the  layer,  stems  from  a paper  of  Segel 
(1969)  on  the  related  problem  with  homogeneous  conditions  on  the  side  walls. 

In  Segel 1 s problem  a simple  solution  (with  no  convection  vortices)  is  possible, 


and  is  shown  by  Segel  to  lead  to  a bifurcation  at  a Rayleigh  number  of  order 

higher  than  that  for  infinite  planes  (no  end  walls).  Segel1  s method  was 
that  of  the  amplitude  equation  and  this  is  followed  by  Daniels.  The  non- 
homogeneous  conditions  at  the  end  walls  force  the  development  of  a basic 
solution  with  convection  cells  for  the  whole  range  of  Rayleigh  numbers  from 


zero.  The  amplitude  of  the  convection  velocity  then  follows  the  form  shown  in 


Fig. 9,  smoothly  by-passing  the  bifurcating  solution  for  non -homogeneous  end 

conditions.  In  Daniels'  work  the  deviation  of  the  Rayleigh  number  from  the 

2 

critical  value  is  limited  to  be  of  order  (d/L)‘,  while  the  normal  temperature 
gradient  on  the  end  walls  is  of  order  6 = XdL  * , with  X constant. 

In  the  companion  and  independent,  paper  by  Hall  and  Walton  (1976), 
the  ratio  L/d  is  not  necessarily  large,  and  the  authors  study  the  problem  defined 
in  the  previous  paragraph,  by  reference  to  the  work  of  Drazin  (1975)  on  the 
linear  eigenvalue  problem  for  homogeneous  end-wall  conditions.  When  the 
normal  temperature  gradient  on  the  end  walls  is  small  and  of  order  6,  the 
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solution  in  the  interior  is  forced  by  those  conditions,  and  convection  develops 

even  at  a low  Rayleigh  number.  More  particularly  if  the  normal  temperature 

gradient  on  each  of  the  end  wells,  as  a function  of  height,  is  of  the  form  of  an 

eigenfunction  of  the  linear  problem,  the  solution  will  become  unbounded  if  the 

Rayleigh  number  is  at  its  critical,  or  bi  furcating , value.  In  this  neighbourhood 

it  is  necessary  to  introduce  the  nonlinear  terms  in  the  differential  equations, 

and  to  effect  this  the  authors  follow  an  argument  introduced  by  Kelly  and  Pal 

(1976)  in  an  important  pioneering  contribution  on  a related  problem,  that  of 

convection  in  an  infinite  layer,  heated  from  below  with  a spatially-varying 

temperature  gradient.  This  reasoning  suggests  that,  in  the  neighbourhood  of 

the  critical  Rayleigh  number  (Ra  ) of  the  Hall-Walton  problem,  the  convection 

c 

amplitude  A,  the  Rayleigh  number  Ra  and  the  magnitude  6 of  the  normal 
temperature  gradient  at  the  end  walls  are  related  by 

(4.2)  A3  = A(Ra-Ra  ) + 6 

c 

A multiple-scaling  argument  confirms  this  suggestion.  We  note,  moreover, 

2 

that  if  6 = O,  the  amplitude  bifurcates  at  Ra=Racfrom  A = O to  A = (lb  -Rac), 

as  in  Fig. 9.  On  the  other  hand,  if  A3  is  ignored  we  have  A = -6/(Ra  - Ra^), 

which  is  singular  at  (Ra -Ra  ),  due  to  resonance  in  the  linearized  solution. 

c 

Although  Hal  I and  Walton1  s analysis  has  no  restriction  on  L/d , a more 

stringent  restriction  is  necessary  for  the  Ra  as  d/L  — > ao  than  in  Daniels'  s 

2 

work,  namely  that  Ra-Ra  «0(d/L)‘.  Hall  and  Walton  also  analyse  cases 
when  the  normal  temperature  gradient  on  the  side  walls  is  not  of  the  form  of  an 
eigenfunction  j in  such  cases  a true  bifurcation  can  occur,  of  supercritical 
or  transcritical  type  (Figures  la,  lc). 

It  is  clear  that  much  work  remains  to  be  done  on  the  calculation  of 
Taylor-vortex  and  other  convective  flows,  especially  to  bridge  the  gap 
between  abstract  theory  on  the  one  hand  and  experimental  observation  on  the 
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other.  The  work  of  Kelly  and  Pal,  of  Daniels  and  Hall  and  Walton  constitutes 
a series  of  important  first  attempts  in  this  area.  Extensions  to  convection  with 
more  realistic  boundary  conditions,  and  to  the  Taylor-vortex  problem,  are 
awaited  with  interest. 
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Preference  in  Pattern  and  Cellular 
Bifurcation  in  Fluid  Dynamics 

Klims  Kirvhgassner 


1 . Introduction 

The  question,  why  certain  motion  patterns  are  selected  in 
some  models  of  fluid  dynamics  which  exhibit  cellular  in- 
stability, has  been  in  the  background  of  every  bifurcation 
analysis  in  this  field.  For  twenty  years  numerous  theo- 
rectical  and  experimental  contributions  with  varying  physi- 
cal insight  or  mathematical  rigour  have  appeared,  yet  our 
knowledge  is  still  fragmentary.  This  survey  is  devoted  ex- 
plicitly to  this  problem.  However,  since  the  excellent 
monograph  of  Joseph  [14]  which  covers  the  state  of  affairs 
completely  has  appeared  recently,  we  shall  raise  the  ques- 
tion from  a new  point  of  view. 

The  celebrated  Taylor  - resp.  Benard  models  in  fluid  dynam- 
ics are  described  by  boundary  value  problems  which  are  in- 
variant under  axial  translations  resp.  the  rigid  motions  of 
the  plane. The  invariance  property  causes  a highly  degen- 
erate situation  and  solutions  of  different  invariance  struc- 
ture are  possible.  Earlier  theoretical  work  in  this  area 
has  exclusively  considered  motions  which  are  periodic  or 
doubly  periodic  with  given  and  fixed  periods.  An  exception 
are  the  important  contributions  of  Busse,  but  there  other 
difficulties  arise  (see  section  4) . Recently,  in  a series 
of  papers,  Sattinger  has  given  a systematic  treatment  of 

Naturally,  for  this  to  be  true,  effects  of  finite  bound- 
aries have  to  be  neglected,  an  assumption  which  is  not  un- 
disputed . 
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bifurcation  and  pattern  selection  among  doubly  periodic  so- 
lutions using  group  theoretical  methods. 

In  this  contribution  we  shall  raise  the  question  whether 
the  assumption  of  periodicity  is  reasonable  or  not,  give 
some  answers,  substantiate  some  conjectures,  state  some  open 
problems,  and  survey  occasionally  related  results.  To  avoid 
technical  difficulties  and  for  the  sake  of  a better  insight 
we  have  chosen  a sequence  of  model  equations  of  increasing 
degree  of  complexity.  Ad  hoc  methods  which  by  their  very 
nature  are  confined  to  the  model  equations  - such  as  maxi- 
mum principle,  sub-  and  supersolutions  - are  sometimes  used 
but  their  limited  range  of  applicability  is  emphasized. 

First  we  treat  an  easy  one-dimensional  problem  where  the 
reader  can  check  the  results  by  using  phase  plane  methods. 
However,  we  phrase  our  results  in  a way  which  opens  itself 
to  generalisations.  It  is  shown  that  almost  all  "small" 
solutions  are  periodic,  and  that,  if  a certain  geometric 
condition  is  met,  singular  solutions  (constant  or  non- 
periodic) exist  as  limits  of  periodic  solutions  with  large 
periods.  For  every  positive  value  of  the  parameter  the  set 
of  solutions  form  a full  neighborhood  of  0.  Nevertheless, 
also  in  this  case,  the  selection  of  a definit  solution  de- 
pends on  the  geometry  of  the  bifurcation.  At  the  end  of 
section  2 we  shall  explain  the  phenomenon  for  some  special 
nonlinearities. 

Only  the  two-dimensional  models  in  the  sections  3 and  4 
provide  a realistic  analogy  to  the  Taylor-  and  the  Benard 
problem.  Since  for  these  problems,  to  every  supercritical 
A the  linearisation  has  solutions  with  at  least  two  differ- 
ent periods  (17],  we  expect  quasiperiodic  solutions  to 

occur.  In  section  3 we  show  that  the  A-range  where  all 

. 2 2 

solutions  are  periodic  is  confined  to  the  interval  ( it  , 4 tt  ). 
2 

If  A > 4 tt  holds  then,  if  we  accept  formal  power  series 

as  solutions,  all  small  solutions  are  quasiperiodic.  Again 
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stability  is  studied  for  special  nonlinearities  and  conclu- 
sions are  drawn  for  the  Taylor  problem. 

In  section  4 we  briefly  discuss  the  model  equation  in  the 
plane.  The  uniqueness  result  which  is  the  key  to  the 
characterisation  of  all  solutions  in  the  earlier  sections, 
is  not  known  in  this  case.  Therefore,  we  can  only  describe 
a formal  method  consistent  with  the  equations  yielding 
quasiperiodic  solutions.  However,  as  is  seen  by  comparison 
with  the  linearized  equation,  the  method  is  far  from  giving 
all  possible  solutions. 

Whereas  our  method  consists  in  a series  expansion  of  the 
solution  and  its  basic  frequencies  in  terms  of  the  ampli- 
tudes of  different  modes  for  fixed  X,  Busse's  method  [2] 
works  with  fixed  frequencies  and  variable  parameter  X and 
thus  is  confined  to  socalled  semiregular  solutions.  The 
methods  are  compared  and  stability  is  studied.  In  particu- 
lar it  is  shown  that,  for  the  model  equation,  a unique 
asymptotically  stable  solution  is  selected  for  a special 
nonlinearity . 
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2 . The  model  equation  in  dimension  one 

In  this  section  we  briefly  discuss  the  simplest  possible 
model  in  one  dimension,  where  the  method  of  phase  plane  pro- 
vides a full  picture  of  all  solutions.  However,  we  leave 
this  instrument  to  the  use  of  the  reader  to  check  the  valid- 
ity of  the  results,  and  we  present  a procedure  which  is 
applicable  in  higher  dimensions  as  well.  The  main  conclu- 
sion is  that,  under  certain  symmetry  conditions  for  the  non- 
linearity, essentially  all  small  solutions  are  periodic. 
Although  obtained  by  general  methods,  its  significance  for 
higher  dimensions  - and  especially  for  the  hydrodynamical 
case  - is  rather  limited,  as  the  comoarison  with  later 
sections  will  show.  A generally  valid  feature  is  the  exis- 
tence of  "singular"  solutions,  if  a certain  geometrical 
bifurcation  property  is  met. 

As  for  simple  bifurcation  problems  stability  and  instability 
is  intimately  connected  with  the  geometry  of  the  bifurcation 
picture  [7];  this  will  be  shown  for  some  special  cases  using 
the  method  of  super-  and  subsolutions,  which,  however,  is 
confined  to  second  order  problems  exclusively. 

Define  the  differential  expressions 
L ( A ) u = u + Au 

yy 

N j (u, A ) = L ( A ) u - f(u,Uy)  - jufc,  j = 0,1 

where  subscripts  indicate  partial  differentiation,  and 

2 2 

where  A is  a real  parameter.  The  function  f e C ( 3R  , 1R  ) 

is  supposed  to  be  o(u),  i.e.  f(0)  = f (0)  = f (0)  = 0 . 

2 — ^ P 

We  study  the  solutions  u e C ( H , K ) of  the  equation 
(2.1)  Nq(u,A)  = 0 , 

in  particular  the  qualitative  characterisation  of  all  small 
solutions . 
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If  f = 0 holds  then,  for  A < 0,  all  nontrivial  solutions 
grow  exponentially,  for  A = 0 they  grow  linearly,  whereas 
for  A > 0 all  solutions  are  periodic.  Restricting  atten- 
tion to  solutions  of  polynomial  growth  the  question  arises 
whether,  for  A > 0,  the  qualitative  picture  is  maintained 
under  nonlinear  perturbations.  This  is  not  true  in  general, 
as  f(u,p)  = p3  shows.  Before  stating  additional  conditions 
on  f,  we  formulate  a uniqueness  result  which  is  trivial 
for  this  simple  case. 


Define 


’u(0)cos  /X  y + — u (0)  sin/X  y,  if  A > 0 
/A  y 


Pu  = 


u (0)  + yuy (0) 


, if  A = 0 


then,  for  every  u e ker  L(A)  there  is  at  most  one  solution 
of  (2.1)  with  Pu  = u . 


Now  we  introduce  the  following  conditions: 
(a)  f(u,-p)  = f (u,p) 


(2.2) 


(b)  f ( u , — p ) = — f(u,p)  , f ( — u , — p ) = f (u , p) 


Proposition  2.1.  If  the  function  f satisfies  the  condi- 
tions (2.2)  a)  or  b) , then,  to  every  A > 0,  there  is  a 
constant  c(A)  > 0,  such  that  all  solutions  u of^  (2.1) 
with  | Pu|  < c ( A)  are  periodic. 

The  proof  will  be  given  in  the  next  section.  It  is  of  in- 
terest to  notice  that  the  existence  of  "singular"  solutions 
is  tied  to  the  geometry  of  the  bifurcation  picture;  e.g. 
if  f(u,p)  = -u3  all  solutions  are  periodic,  no  singular 
solution  exists.  Every  point  A > 0,  u = 0 is  a bifurca- 
tion point  of  periodic  solutions  and,  if  attention  is  con- 
fined to  a fixed  period,  we  have  bifurcation  to  the  left.  It 
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turns  out  that  the  essential  feature  is  the  fact  that  at 
X = 0 small  nontrivial  solutions  exist. 

R 

To  be  specific:  H (Q)  denotes  the  space  of  functions  hav- 
ing distributional,  square  integrable  derivatives  up  to  the 

k k 

order  k in  Q,  H1qc  = {u  € H (K) | K compact  in  fi}  . Set 
ua(y)  = u(y+a)  and 

E (U)  = SUP  ||  U0  ||  y 

Of®  H (-1,1) 

For  fixed  X,  the  component  of  connectedness  in  Hr  (R) 

loc 

whose  closure  contains  u = 0 is  denoted  by  r 

A 

Proposition  2.2.  Suppose  that  f satisfies  (2.2)  a)  orb). 
Assume  that  for  every  e > 0 there  is  a positive  6 such 
that  for  all  X'e  (0,  6) 

sup  E(u)  < c 

UePX 

holds.  Then,  for  sufficiently  small  X > 0,  there  is  an 

open  and  bounded  neighborhood  U(X)  of^  0 in  ker  L(X) 

such  that,  for  every  u e closure  U(X),  a solution  u of 

(2.1)  exists  with  Pu  = u . For  u e U(X)  u is  periodic, 

if  u « 9U(X)  this  u is  singular  (either  constant  or  non- 

periodic) . In  the  latter  case  u is  the  H?  (R) -limit  of 

— loc  

periodic  solutions  having  arbitrarily  large  irreducible  per- 
iods . 

Typical  examples  are  u , u , with  X resp.  +/T  and  exact- 
ly one  resp.  two  nonconstant  singular  solutions  (up  to  trans- 
lations) . The  case  f(u,p)  = up  provides  an  example  where 
the  condition  for  E(u)  is  violated  and  where  no  bounded 
singular  solution  exists.  For  the  proof  of  proposition  2.2 
see  [18].  Similar  arguments  for  the  existence  of  solitary 
waves  have  been  used  in  [1]. 
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; 

Stability  is  understood  in  the  sense  of  Liapunov  with  re- 
spect to  solutions  of  the  parabolic  equation 

(2.3)  N ( u , A ) =0.  u I = u 

1 1 t=0 


As  measure  of  closeness  we  use  the  norm  ||u||  = sup|u(y)  | . 

3R 

Accordingly,  u is  called  stable  if,  for  every  e > 0 
there  is  a positive  6 such  that  ||  u ( * , t ) 1 1 < e if 

II  u ( • , 0 ) ||  < 6 for  all  solutions  u of  (2.3);  furthermore, 

u is  called  asymptotically  stable,  if  lim  ||  u ( • , t ) II  = 0 . 

t '►00 

A solution  u of  (2.3)  is  called  unstable  if  it  is  not 
stable.  Note,  that  the  proof  of  instability  requires  the 
existence  of  a solution  of  (2.3)  of  arbitrary  small  norm  at 
t = 0 leaving  eventually  a certain  neighborhood  of  u . We 
shall  not  dwell  on  this  point  since,  by  the  use  of  the  esti- 
mates in  [20],  p.  273,  existence  can  be  shown  easily. 

The  special  structure  of  (2.3)  can  be  used  to  give  detailed 
information  about  the  stability  behavior. 

However,  since  the  methods  (maximum  principle,  differential 
inequalities)  restrict  the  range  of  possible  applications 
to  second  order  problems,  we  confine  ourselves  to  some  spec- 
ial situations  exhibiting  typical  phenomena.  We  make  use  of 
the  following 

Proposition  2.3.  Consider  sufficiently  smooth  functions 
Cl , u which,  together  with  their  derivatives  are  bounded  in 
]R  X [0,T]  . Suppose 

N^(u,A)  < 0 and  N2(u,A)  > 0 

A i V . 

u > u 

1 t=0  “ ' t=0 

then  u > u in  1R  X [0,T]  . 
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The  proof  is  a consequence  of  Theorem  10,  [22],  p.  183.  Ob- 

viously 


f (u,uy) 


V V 

f (U,u  ) 


, A V .A 

c (u  - u)  + d (u 


with  bounded  coefficients  c and  d . The  growth  condition 
is  fulfilled  in  view  of  the  boundedness  of  u,  u and  their 
derivatives . 


We  formulate  the  stability  results  for  nonlinearities  f de- 
pending on  u only.  They  remain  true  locally  near  0 for 
more  general  f,  if  the  leading  term  is  independent  of  p . 

Proposition  2,4 

a)  Let  be  solutions  of  (2.1)  such  that  uQ(y)  > u1  (y) 

for  at  least  one  y . Assume  f to  be  convex  near  u^  . 
Then  u^  is  not  asymptotically  stable. 

k>)  If,  in  addition,  (Uq-u^  (y)  > a > 0 for  all  y e 1R  and 
if  f is  strictly  convex,  then,  u is  unstable . 

c)  Let  be  uf'(u)  - f(u)  > 0 for  u > 0.  Then,  every  pos- 
itive solutions  of  (2.1)  is  stable.  If  the  same  inequali- 
ty holds  for  u < 0,  every  negative  solutions  is  stable. 

d)  If,  in  addition,  uQf ' - f(uQ)  > Y > 0 holds  for 
some  positive  (negative)  solution  uQ  of  (2.1)  then , 
u0  is  asymptotically  stable. 

For  the  proof  of  a)  use  proposition  2.3  with 

V 

U z £U0  + (l-e)u^,  e e (0,1),  and  u solution  of  (2.2) 
with  u|t_Q  _ u • Assertion  b)  follows  via  the  same  an- 
satz , if  we  set 


r0e 


Tt 


aB. 

~rt 

1 + eQ(e  ^ -1) 


£(t) 


0 < B < f"  (u)  . 
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To  show  c)  , use  u = c^u^,  u = c2uo'  0 < c2  < ^ < ci  anc^ 
remark  that  f(u)/u  is  increasing;  a similar  argument  for 
Uq  negative.  For  the  proof  of  d)  use  the  same  ansatz  and 
set  (a  £ uq  = 

0 

C1 

C.  (t)  - q Q — 2 , 

c^~ (c^-1 ) exp (-ayt/M  ) 


T* 

M 


c2(t) 


l-c^+c^exp (a yt/M  ) 


< 1 < c 


We  conclude  that,  if  X < 0 and  f(u)  = u , k even,  all 

periodic  solutions  are  not  asymptotically  stable,  since  they 

are  bounded  above  by  u_  = X^^  ^ . Moreover,  for  X > 0 

2 u 

and  f(u)  = u all  periodic  solutions  are  unstable  (accord- 
ing to  b) ) , the  nonconstant  singular  solution  is  not  asymp- 
totically stable  (see  a) ) and  the  constant  solution  un  = X 

k u 

is  asymptotically  stable  (see  d) ) . For  f(u)  = u , k odd, 
and  X > 0,  the  constant  solutions  ±X^'  ^ ^ are  asymp- 
totically stable. 

To  prove  that  all  periodic  solutions  are  not  asymptotically 
stable  in  this  case  one  can  use  a local  version  of  prooosi- 
tion  2.1  (it  is  a special  form  of  a theorem  which  I owe  to 


P.  Fife) ; Let  u^  resp.  u2  be  bounded  solutions  of  (2.1) 
in  the  open  regions  resp.  > where  fi2  is  bounded, 


]R  = 

u fi2  and  let  u2 | an  = 

u.  t _n  . Assume 

X 1 d db  n 

u to  be  a 

solution  of  (2.3)  in  ]R  X [0,T] 

with  u t_Q  1 max(Uj,U2)  , 

then 

u > max(u^,u2)  in  1R  X [ 

0 , T]  . The  proof 

proceeds 

along 

the  same  line  as  that  of 

proposition  2.1. 

Now, 

take  any  periodic  solution 

u^  and  another 

solution  u2 

near 

u^  which  intersects  u^ 

in  yQ  and  y^ 

such  that 

u2  > 

u^  holds  in  ( y Q , y ^ ) and 
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u2  < u ^ outside  [ y Q , y ^ ] then,  maxfu^Uj)  is  a lower 

bound  for  all  solutions  of  (2.3)  starting  with  some 

Uq  > maxlu^Uj)  . Hence,  u^  is  not  asymptotically  stable. 

Let  us  remark,  that,  if  f(u)  = -u  there  are  only  periodic 
solutions  for  X > 0 and,  according  to  the  preceding  argu- 
ment, they  are  all  asymptotically  unstable.  A usual  bifur- 
cation analysis  for  solutions  of  (2.1)  with  a fixed  period 
shows  that  all  branches  bifurcate  to  the  left,  whereas  for 
f(u)  = u , where  stable  solutions  exist,  they  bifurcate  to 
the  right.  All  results  remain  true  locally  near  u = 0 if 
f is  of  the  form  f(u)  = u (±1  + g(u,uy))  where  g(u,uy)  = 
° ( I u | + |uy| ) . 


3 . The  model  equation  for  a two-dimensional  strip 
This  case  already  contains  all  characteristics  of  some  cele- 
brated hydrodynamical  models,  so  that  a well  founded  conjec- 
ture about  the  set  of  all  small  solutions  is  possible.  We 
shall  show  that  their  character  changes  dramatically  while 
X passes  through  certain  critical  points.  Quasi-periodic 
solutions  appear.  Stability  properties  are  summarized  using 
the  "ad  hoc"  method  of  proposition  2.1.  Less  strong  results 
are  derived  using  a general  method  often  used  for  stability 
problems  in  hydrodynamics.  At  the  end  we  survey  possible 
extensions . 

Define  the  differential  expressions 

2 2 

L ( X ) = A + X , A = — + —^2 

3x  9y 


where 


N , (u , X)  = (A  + X ) u - f (u,u,u)  - ju  , 
j x y t 

f € C2(IR3,1R)  and  f(u,p,q)  = o(ju|  + 


j = 0,1 
I p I + lq| ) 


holds . 
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Consider  the  following  boundary  value  problem 

(3.1)  Ng (u,  X ) = 0,  u(0,y)  = u ( 1 , y ) = 0 

where  (x,y)  e [0,1]  X ]R  = fl  . A solution  which  is  periodic 
or  quasiperiodic  in  y is  simply  called  periodic  resp. 
quasi-periodic . (3.1)  always  possesses  the  trivial  solution 

u = 0 . We  are  interested  in  characterizing  all  small  non- 
trivial solutions. 

2 

Consider  the  linearized  equation  (A  + X)u  = 0 . For  X < it 

2 2 

all  nonzero  solutions  grow  exponentially,  for  X e (ti  ,4tt  ) , 

ker(A  + X)  is  2-dimensional  and  all  solutions  are  periodic, 

2 2 2 2 

for  X e (n  -rr  , (n+1)  tt  ) , n > 2 , ker(A  + X)  is  2n-dimension- 

al  and  all  solutions  are  quasiperiodic,  i.e.  they  are  of  the 

, ,0  o,  Or. .2  2.1/2  . 

form  u(x,y)  = v(x,u>^  y,...,wny),  ok  = IX-]  it  } ' , and 

v(x,  z.  , . . . , z ) is  2-n-periodic  in  every  z.  . (We  say:  v 

in  j 22 

is  27r-periodic  in  z)  . For  the  exceptional  points  X = n n 

the  2n-dimensional  space  ker(A  + X)  consists  of  solutions 

having  polynomial  growth. 


Hence,  the  spectrum  of  A + X consists  of  the  semiinfinite 
2 

interval  [it  , °°)  and  every  point  is  a nonisolated  eigen- 
value of  finite  multiplicity.  To  be  specific:  Define 
g^ (y)  = max(l,|y|  ),  k e IN  g , and  the  Banach-spaces 


X®  = {u  e HS 


loc 


(n) 


s , k 


sup {/  g,  2 I D lu  I 2du  }1//2  < “} 


a <s  £2 


X®  = (u  e X®  | u(0,-)  = u(l,«)  =0}  , s > 1/2 


. . S s 

and  consider  the  inductive  limits  X resp.  X : 


u xs 

k=0  k 


s _ 


O g 

Ek=0  Xk 


then , 

X < IT 

and 


2 

A 


°2  0 

A + X:  X -*■  X is 
whereas,  for  X e 
+ X is  onto.  (see 


a topological  isomorphism  for  all 
[n2  ft2  , (n+1)  2tt2)  , dim  ker(A+X)=2n 
[18]  ) . 
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The  question  arises  under  which  conditions  the  qualitative 
picture  is  maintained  for  small  nonlinear  perturbations.  As 
a first  step  we  formulate  a uniqueness  result.  Define 

1 

u (y)  = /2  / u (x , y)  sin  virx  dx 

^ n 


if 


~ 0 

P^u  = l sin  vttx  (u^(O)cos  w^y  + — ^ 

v wv 

X e (n2n  2 ' (n+1)  2tt2  ) and 


u (0)  sin  oj^y  } 
v ,y  v* 


P u = P ,u  + sin  nix  (u  (0)  + u (0)y) 
n n-1  n n,y 

if  X = n2TT2.  Set  u° (x ,y)  = u (x ,y+o)  and  = [0,1]  X [-1,1]  . 

2 2 2 2 

Proposition  3.1.  Let  X e [n  i , (n+1)  tt  ) , n e IN  . 

Then,  there  exists  a number  e > 0,  such  that,  if  u and 
v are  solutions  of  (3.1)  satisfyinq  p u = P v and 

SUP  II  u°ll  o < e'  supH  v°  II  p < e » 

OelR  H (K1)  aeJR  n (K^ ) 

it  follows  that  u = v . 

2 

Moreover,  for  X < tt  , u = 0 is  an  isolated  solution  of 
02 

(3.1)  in  X 

In  contrast  to  the  one-dimensional  case  it  is  here  by  no 
means  trivial  that  the  behavior  at  infinity  can  be  controlled 
by  the  trace  of  u in  ker(A  + X)  since  many  solutions  of 
exponential  growth  are  present.  The  proof  goes  far  beyond 
the  frame  of  this  survey;  we  refer  the  interested  reader  to 
[18]  . 


As  the  preceding  section  suggests,  existence  cannot  be 
proved  without  further  symmetry  properties  of  f . Hence  we 
make  one  of  the  following  two  assumptions 

a)  f (u,p,-q)  = f (u,p,q) 

(3.2) 

b)  f(u,p,-q)  = -f (u,p,q) , f(-u,-p,-q)  = f(u,p,q) 
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2 2 

Proposition  3.2.  Let  be  A e (tt  , 4tt  ) and  assume  that  f 

satisfies  (3.2)  a)  or  b).  Then,  there  is  a constant  c ( A ) 

such  that,  for  every  u e ker(A  + A)  with  sup  |u|  < c(A) 

fi 

there  exists  a periodic  solution  of  (3.1)  with  P^u  = u . 


Together  with  the  preceding  proposition  we  conclude  that,  for 
2 2 

A e (tt  , 4 tt  ) all  small  solutions  (small  in  the  sense  of 
prop.  3.1)  are  periodic.  Actually,  having  constructed  the 
solutions  u^  which  are  even  in  y,  every  other  solution 
can  be  obtained  by  a simple  translation  u°  = u ( • , • +o)  . 


We  indicate  the  proof.  Set  u(x,y)  = v(x,wy),  and  consider 
the  auxiliary  problem 


(3.3) 


(D(w)  + A)v  - F(w,v)  = 0 

v(0,z)  = v(l,z)  =0,  v 2iT-periodic  in  z 


where  D(oo)H  92/3x2  + aj292/3y2,  F(a),v)  = -(v'vx'wvz)  . 

Solve  (3.3)  by  the  method  of  Liapunov-Schmidt . Set 

1 izi 

<p^(x,z)  = — sin|j|irx  • e ^ , j = ±1 
/tt 


z!  = "Z-l  = Z 


and  define 


v = vi  + v2,  vL  = 


J.  c *- 

ljl=l  3 


whe 


re  v2  is  in  the  orthogonal  complement  of  v^  then,  v 


is  a function  of  the  c.'s  . Using  property  (3.2)  one  is 

) T T 2 2 

able  to  prove  that  NL(t,c)  = (F(t  ,v1+v2)  ,<p J)  /cJ  , t = w - u)Q  , 
is  a CP  ^-function  near  t = 0,  £ = 0 and  that  (0,x)  = 0, 
NL (t,c)  = N_t (t,c)  . Therefore,  the  bifurcation  equations 

read  for  c * 0 


- N j (c , t ) = 0 , 


= 1 


and  thus  have  locally  a 


,P-1 


solution 


t (c)  . 
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Together  with  Proposition  3.1  we  have  thus  a complete  des- 
cription of  all  small  solutions  in  the  parameter  interval 

2 2 2 
< 7T  ,4tt  ) . When  A approaches  tt  from  the  right,  singular 

solutions  exist  if  a certain  geometric  property  is  met.  The 

result  is  literally  the  same  as  that  given  in  Proposition 

2.2,  if  H2(-l,l)  resp.  H2qc(R)  is  replaced  by  H2(K^) 

2 2 
resp.  H.  (ft)  . When  A passes  through  4 tt  a new  situa- 

2 2 

tion  arises.  For  fixed  A e ( 4 tt  , 9tt  ) we  have  two  fre- 

0 2,1/2  0 ..  . 2,1/2  . . , 
quencies  w.  = ( A— tt  ) , = ( A — 4 tt  ) which,  in  gener- 

1 1 0 0 
al,  are  rationally  independent,  i.e.  n^w^  + n2w2  = ®'n^'n2 

c TL , implies  n^  - = 0 . However,  since  n^w^  + n2u>2 

can  be  made  arbitrarily  small,  small  denominators  appear 
when  differential  operators  are  inverted.  Formally  the  con- 
structive method  of  Liapunov  and  Schmidt  used  above  for  the 
periodic  case  can  be  carried  through  if  f is  analytic. 


Hence,  let  us  assume  that  f is  ahalytic  in  its  variables 
and  suppose  that  one  of  the  conditions  (3.2)  a)  or  b)  holds. 
Set  u(x,y)  = v(x,w^y, # where  v(x,z^,Z2)  is  2 tt  — 
periodic  in  z . Then,  problem  (3.1)  is  transformed  into 


(3.4) 


(D(m)  + A)  - FU,v)  = 0 

v(0,z)  = v(l,z)  =0,  v 27T-periodic  in  z 


where  D(w)=  32/3x2  + \ oj  .w.  8z/3z  . 3z  . , 

j,k=l  3 * 33 


F(w,v)  = f(v,v  , l oj.v  ) 

X j=l  ] zj 

Suppose  that  A c (4tt2,9tt2)  is  such  that  u®  and  are 

rationally  independent.  Set  T2  = (0,1)  X (0,2tt)  X ( 0 , 2 tt  ) , 

t = a)2  - oo9 2 and  define  the  spaces 
0 2 

Hj  li  (v  e L (ft)  | v 2n-periodic  in  z^ 

H2  = {v  £ H2 (ft)  n H°  I v ( 0 , • ) = v ( 1 , • ) = 0) 
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Decompose  v as  follows:  v = + v ^ , where 

e ker(D(w°)  + X)  and  e C°  c h# , C°  = ker (D (w® ) +X)1  ' 

Then,  D(w^)  + X:  H#/ker  (D  (u/3)  + X)  ■+ C°  is  one  to  one,  has 
dense  range,  but  is  not  continuously  invertible.  Hence,  the 
method  of  Liapunov-Schmidt  fails.  Yet,  there  is  a formal 
power  series  solution  of  the  form 


V2  = 


| a | >0 
I B I >2 


81  B2  ai  a2 
C-,  Cn  i w 
1 2 1 2 a , 6 


with 


v,  = 


/ c ,<fi 

I j 1=1  3 


/2 


iz  . 


<p.(x,z)  = — sin  i n x e 

j — it  11 


j = ±1,  ±2,  z_j  = -z j , c_j  = Cj  . It  is  easily  seen  that 
there  is  a unique  solution  w for  every  a,B  • However, 

0t  f p 

the  proof  of  convergence  seems  to  be  quite  hard  since  small 
denominators  occur.  Apart  from  the  convergence,  the  system 
of  equations  determining  v^  can  be  solved  quite  similarly 
as  in  the  periodic  case.  Using  property  (3.2)  one  is  able 
to  prove  that  N . (t_/£)  = (P  (t_,  v.  +v„  ) 3 ) /c  . , |j|  = 1,2,  is 

an  analytic  function  near  = 0,  c = 0,  satisfying 
N_ j = Nj (£,£)  • Then,  if  £ * 0,  the  bifurcation 
equations,  determining  v-^ , read: 

^ - Vi.si  * °-  i * ll2 

with  Nj(0,0)  = D,Nj(£»£)  = 0 . Hence,  a unique  analytic 

solution  t_ ( £)  exists  near  c = 0 (c.f.  (18]).  A similar 

2 2 2 2 

analysis  holds  for  X e (n  it  , (n+1)  it  ) , n > 2 . Together 
with  Proposition  3.1  it  supports  the  following 

2 2 2 2 

Conjecture  3.3.  Let  be  X e (n  tt  , (n+1)  ir  ) , nc  IN,  n > 2 . 
Set  oj9  = (X-j  n ) ■*'/2  and  suppose  that  o)^,...,o)^  are 
rationally  independent.  Assume  that  (3.2)  a)  or  b)  holds 
for  f which  is  real  analytic  near  0 . Then,  there  is  a 
positive  constant  c(X)  such  that  for  every  u € ker(A+X) 
with  sup|u|  <c(X)  there  exists  a guasiperiodic  solution 
Qi  (3.1)  with  Pnu  = u . 


I 


' 
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Very  likely,  the  assumption  of  rational  independence  is  not 
sufficient  and  stronger  non-resonance  conditions  have  to  be 
imposed  (c.f.  [21]).  It  is  of  interest  that  - at  least 

formally  - this  method  gives  a systematic  approach  for  the 
determination  of  all  solutions. 

Stability  and  instability  can  be  studied  with  the  same 
method  as  in  the  preceding  section.  Proposition  2.3  is 
still  valid  and  hence,  we  conclude  from  Proposition  2.4,  that 
periodic  and  quasiperiodic  solutions  u-^  are  not  asymptoti- 
cally stable  if  a solution  u^  exists  satisfying  uQ(x,y)  > 

u. (x,y)  for  (x,y)  e (0,1)  X ]R  . a bifurcation  analysis 
1 2 

near  A = it  shows  that,  if  the  leading  terms  of  f are 
2 3 

au  + bu  , ab  > 0,  a + b > 0,  the  even  solution,  starting 
2 

in  u = 0,  A = it  , majorizes  all  small  periodic  solutions. 

But  for  the  proof  of  this  local  result  one  does  not  really 
need  the  special  structure  of  our  model  equation.  A well 
known  argument  from  hydrodynamical  stability  theory  shows 
that  a periodic  solution  bifurcating  in  (0,A(w  ))  cannot 
be  stable  if  A (u>  ) is  not  a local  minimum  of'  A ( ui ) (c.f. 

[27],  [16],  [19]).  We  shall  briefly  indicate  this  argument 

tu.'  the  present  case. 


2 2 2 

Let  iDg  satisfy  ti  < < 4 tt  ' and  consider  = uig/n  for 

2 2 2 2 

some  large  integer  n . Then  A^  = n + < ti  + = Aq 

holds.  Fix  let  A vary  and  confine  the  bifurcation 

analysis  of  (3.1)  to  solutions  which  are  271/w^-periodic  in 
y . The  solution  (u,A)  emanating  from  ( 0 , A ^ ) , which  has 
2tt/o)q  as  irreducible  period,  lies  in  this  larger  class  too. 
The  principle  of  linearized  stability  is  well  established 
for  the  problem  under  consideration  (c.f.  [15]).  Hence,  the 

spectrum  of  JC  = (A  + A)  - f 1 (u)  determines  stability  and 
instability  of  u . However,  for  A = An,  u = 0,  £ has  the 

positive  eigenvalue  (n  -l)u)g/n  . Since  the  eigenvalues  vary 
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continuously  with  u and  X , there  is  a full  neighborhood 
u = 0,  X = XQ  with  a positive  eigenvalue  of  £ . Hence,  all 
solutions  bifurcating  in  (0,X(ui®))  where  X(to^)  is  not  a 
minimum  of  X(u>)  are  unstable  near  the  bifurcation  point. 

At  the  end  of  this  section  we  transfer  our  results  to  the 
Taylor  problem  of  hydrodynamical  stability.  The  viscous  flow 
between  two  rotating  coaxial  circular  cylinders  of  infinite 
length  bifurcates  into  a space-periodic  stationary  motion 
(the  Taylor  vortices)  if  the  Reynolds-number  surmounts  a 
certain  critical  value  Xcr  . There  are  three  coupled  equa- 
tions for  the  three  velocity  components  describing  this  phen- 
omenon. They  are  of  the  type  (3.1)  if  rotationally  symmetric 
solutions  are  considered  (f  is  purely  quadratic).  The 
parameter  w denotes  the  wavenumber  of  the  vortices  ([6]). 
Ample  numerical  evidence  shows  (no  proof  is  known)  that  the 
neutral  curve  X (to)  is  strictly  convex  with  X -*•  « for 
to  -*  0 and  for  oj  -+  °°  . There  is  a unique  wavenumber  <ocr 
determined  by  X(wcr)  = ^cr  • Quite  a number  of  experiments 
indicate  that,  for  X > Xcr,  wavenumbers  ranging  in  a "band" 
around  w can  be  realised.  The  wavenumber  actually  ob- 
served  depends  e.g.  on  the  way  the  stationary  state  is 
reached.  ( [28] ) . 

For  values  of  X slightly  beyond  X there  are  exactly  two 

wavenumbers  with  X = X(ok),  which,  for  almost  all 

X,  are  rationally  independent  ([13]).  Hence  we  may  conjec- 
ture that  in  this  region,  all  small  rotationally  symmetric 
solutions  of  the  Taylor  problem  are  quasiperiodic. 

Some  indications  in  this  direction  are  found  [9]  and  in 
DiPrima,  Eckhaus  and  Segel  [8].  They  show  that,  for  X 
slightly  larger  than  ^cr>  the  full  system  of  the  Navier- 
Stokes  equations  can  be  formally  solved  by  vortex  flows 
having  arbitrarily  large  periods.  If  the  amplitudes  of  the 
harmonic  modes  are  chosen  carefully  enough  one  can  construct 
solutions  by  superimposing  periodic  flows  with  wavenumbers 
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regularly  distributed  in  the  amplified  interval  of  the  lin- 
earisation. Although  this  interval  is  bounded  below  by  a 
positive  number  the  resulting  solution  has  an  irreducible 
period  which  corresponds  to  the  difference  of  two  neighbor- 
ing modes  and  thus  is  arbitrarily  large. 

The  role  of  the  singular  solution  in  our  models  is  played  by 
the  vortex  flow  with  wavenumber  a>  for  the  Taylor  problem. 
According  to  the  preceding  argument  it  is  the  only  possible 
stable  solution  near  X . We  have  already  mentioned  that 
- as  experiments  show  - for  larger  values  of  X a band  of 
wavenumber  seems  to  be  stable.  Our  preceding  stability 
argument  implies  the  existence  of  a strip  of  unstable  wave- 
numbers  inside  the  neutral  curve  and  tangent  to  it  at 
w = wcr  . Kogelman  and  DiPrima  [19]  determined  this  region 
quantitatively.  However,  the  theoretical  and  experimental 
data's  are  still  far  apart.  Moreover,  nothing  is  known 
about  regions  of  attractions  for  stable  solutions. 

In  the  light  of  the  analysis  presented  here  another  problem 
arises.  Up  to  now  the  theoretical  and  experimental  interest 
was  concentrated  on  space-periodic  solutions  of  the  Taylor 
problem.  However,  they  very  likely  form  a small  subset  of 
all  solutions  which  are  quasiperiodic . Naturally  it  will  be 
almost  impossible  to  decide  experimentally  whether  a given 
solution  is  periodic  or  quasiperiodic.  Most  of  these  methods 
determine  phenomenologically  the  periodicity  of  the  flow  by 
counting  the  observed  number  of  vortex  cells.  However,  an 
experimental  decision  whether  a purely  periodic  motion  oc- 
curs or  not  must  include  an  exact  determination  of  the  stag- 
nation planes.  It  seems  that  for  this  point  numerical  ex- 
periments simulating  the  selection  properties  of  nature 
could  be  of  great  value. 
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4 . The  model  equation  in  the  plane 

In  this  section  we  discuss  equation  (3.1)  when  ft  is  the 
2 

whole  of  1R  . A full  understanding  of  the  qualitative 
structure  of  solutions  and  of  their  stability  would  shed  new 
light  on  the  famous  Benard-problem  which  describes  the  self 
excited  convective  motion  in  a viscous  fluid  layer  heated 
from  below  (c.f.  [6]).  Although  a classical  problem  of 

fluid  dynamics,  it  has  recently  attracted  interest  of  var- 
ious other  fields  of  science  such  as  geophysics  [5] , of 
Laser-theory  [11] ; for  other  applications  see  [24]  . Despite 
the  long  history  and  the  strong  efforts  to  explain  the 
appearance  of  a definite  cellular  instability  and  despite 
the  beautiful  results  of  Busse  [2]  and  Iudovich  [12]  , our 
systematic  knowledge  of  it  is  still  fragmentary,  to  say  the 
least.  In  an  important  series  of  papers  D.  Sattinger  has 
recently  shown  that  by  group  theoretic  methods  the  bifurca- 
tion equations  can  be  classified  according  to  the  invariance 
properties  of  cell  patterns  considered  [25]  and  that  these 
classifications  can  be  used  to  describe  pattern  preference 
by  stability  arguments  among  the  set  of  doubly  periodic 
solutions  [26] . 

Let  us  return  to  our  model  equation 

(4.1)  Nq(u,A)  = 0 

2 

which  now  is  considered  in  ft  = ]R  . The  linearized  equa- 
tion (f=0)  has  only  solutions  of  exponential  growth  if 
A < 0;  for  X = A > 0 one  concludes,  using  Fourier  trans- 
form, that  u - as  a tempered  distribution  - has  support  on 
the  circle  |x|  = A^  and,  according  to  the  Paley-Wiener 
theorem  can  be  extended  to  an  entire  function  ( [29]  ,p. 305) . 
Hence  u can  be  represented  as  follows 

(4.2)  u(x)  = / c(u>)  e*-  - do 


0 
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where  c(to)  is  an  integrable  function  and  do  denotes  the 

2 

line  element  of  the  unit  circle  in  ]R 

The  first  step  of  a systematic  analysis  should  prove  that 
every  small  solution  has  a unique  trace  in  ker(A  + X)  de- 
termined by  (4.2).  We  are  not  yet  able  to  give  such  a proof. 
Up  to  now  there  is  also  little  hope  to  construct  sufficiently 
many  solutions  of  (4.1)  - even  formally  - "covering"  the 
whole  of  ker(A  + X)  . However,  confining  attention  to 
finite  Riemann-sums  approximation  of  (4.2)  one  can  proceed  as 

in  the  previous  sections.  Set  w . = (a.,3-),  I to . I = a . + 

2 “3  3 3 1 -J  1 3 

$.  = Xn,  Xn  >0  fixed,  to  = (to,,..., to  ),  and  let 
] 0 u — — i — n 

u (x,y)  = (a1x,...,anx,81y,...,B  y) 

where  v(z^,...,z  , £ ,..., £n)  is  27T-periodic  in  every  z. 
and  £ ( 2ir-per iodic  in  z and  £ ).  Then  (4.1)  is  trans- 

formed in  the  following  boundary-value-problem  for  v : 


(4.3) 

where 


(D(oo)  + X)v  - F(io,v)  = 0 


DU)  = 1 (ajBk  32/3z.3zk  + 6 ■ 8k32/3  C j 3CR) 

j » k=l 


and 


n n 

F (to , v)  = f ( v , l ot.v  . , l 8 • v . ) . 

j = l 11  Z]  j = l 3 (’3 


We  treat  (4.3)  as  a bifurcation  problem  with  the  2n-dimen- 
sional  vector  to  as  the  bifurcation  parameter  varying  near 

<o°,  l<*ul  = ' 3 = 1 » ■ • • * n . The  method  of  Liapunov- 

3 u 0 

Schmidt  is  formally  applicable  if  to  and  f satisfy  cer- 
tain conditions.  The  condition  imposed  on  wu  states  that 


(4.4) 


n 

( l 

j = l 


0,2  , 
n . a . ) + 

D D 


n 

( l 

j=i 


m.B°)2  = Xo' 


nj'mk 


e 2 
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implies 

dex.  It 

0,  , 
w . s of 
3 


(n  . , m . ) = 0 
3 3 

corresponds 


with  the  exception  of  exactly  one  in- 
to the  rational  independence  of  the 


the  last  section. 


The  condition  imposed  on  f requires  that  one  of  the  follow- 
ing alternatives  should  hold 


a) 

f (u , -p , -q ) = f (u,p,q) 

(4.5) 

b) 

f (u , -p , -q ) = - f (u , p , q ) 

f(-u,-p,-q)=  f (u,p,q) 

Given  (4.4)  , 

(A  + 

X)v  = 0 implies 

where  c . = c.,  z . = -z  . , L . = -t  . • The  formal  analysis 

~3  3 ~3  3 ~3  3 

of  the  previous  section  can  be  carried  out  for  this  quasi- 

periodic  case  too  and  we  obtain:  If  the  conditions  (4.4) 

and  (4.5)  are  satisfied  then  (4.3)  has  a formal  power  series 

solution  (v, x, , . . . ,t_ ) (c, , . . . ,c  ) where  t.  is  defined  bv 
— i n l n i — *- 

20^  20^ 
a.  - a”  = 8.  - BU  • 

3 3 3 3 


We  are  not  yet  able  to  prove  this  assertion  since  obviously 
problems  of  small  denominators  arise.  In  addition,  we  are 
not  sure  whether  this  approach  provides  all  small  solutions. 

For  comparison  let  us  shortly  review  the  methods  of  con- 
structing solutions  of  the  Benard-problem.  The  most  general 
is  the  one  of  Schliiter-Lortz-Busse  [27]  first  applied  in 
Busse's  dissertation  [2]  where  u and  X are  expanded  in  a 
power- series  of  e,  a formal  parameter  measuring  the  ampli- 
tude of  u . While  X varies,  |uk|  is  fixed  at  the  point 
u>cr  where  the  neutral  curve  X(|w|  ) attains  its  unique 
minimum  (the  neutral  curve  X ( | m | ) being  the  graph  of  the 
smallest  eigenvalue  of  -A  for  doubly  periodic  solutions 
e , with  w = (a, 8);  as  for  the  Taylor  problem  it 
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is  numerically  known  to  be  strictly  convex) . 

Busse  shows  that  his  approach  is  consistent  with  the  so- 
called  Boussinesq-approximation  if  the  lattice  of  the  w, 
is  semiregular,  i.e.  the  angle  between  two  neighboring  um 
alternates  between  two  fixed  values.  In  general  these  solu- 
tions are  not  doubly  periodic  and  again,  as  Joseph  [14] 
pointed  out  already,  small  denominators  have  to  be  handled 
if  the  method  should  be  justified.  Evidently,  all  regular 
cell  patterns  such  as  rolls,  squares  and  hexagons  are  con- 
tained in  Busse' s approach.  For  these  cases  however,  proofs 
of  existence  are  known,  analytical  as  well  as  topological 
([12],  [23],  [10]).  Our  approach  however  seems  to  be  more 

general  as  the  previous  ones,  and  the  results  of  the  pre- 
ceding sections  even  nourish  the  hope  that  they  eventually 
yield  all  solutions. 

Stability  and  instability  for  the  model  equation  is  under- 
stood again  with  respect  to  solutions  of  the  Cauchy  problem 
N^(u,A)  = 0,  U|t_Q  = u . Observe  that,  for  f(u,p,q)  = 

2 

u + higher  order  terms,  there  is  a constant  positive  solu- 
tion bifurcating  in  u = 0,  A = 0 to  the  right.  Hence,  in- 
voking Proposition  2.4  we  conclude  that,  for  A > 0,  among 
all  bounded  solutions  in  some  neighborhood  of  0 (depending 
on  A) , the  constant  solution  is  the  only  asymptotically 
stable  one. 

For  the  Bdnard  problem  the  role  of  this  distinguished  solu- 
tion is  played  by  the  motions  with  |w|  = wcr  . As  for  the 
Taylor  case  one  concludes  that  the  graph  of  A(|w|)  bounds 
a region  of  unstable  pairs  (|w|  ,A)  below.  Among  the 

semiregular  solutions  of  Busse,  the  rolls  are  the  only  stable 
ones  if  the  Boussinesq  approximation  is  considered.  In  [27] 
an  argument  is  given  that,  if  the  first  order  term  in  the 
expansion  of  u contains  more  than  one  harmonic  mode  (n  > 1) 
then  the  linearisation  of  (4.1)  about  u contains  a positive 
eigenvalue  in  its  spectrum.  The  situation  for  generalisa- 
tions of  the  Boussinesq-model  is  somewhat  more  complicated. 
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The  interested  reader  is  referred  to  [3] . 

If  one  accepts  the  principle  of  linearized  stability  (no 
proof  is  known  in  the  present  case)  then  the  results  of 
Busse  suggest  a strong  preference  for  roll  patterns.  Ac- 
cording to  (27)  only  those  rolls  can  be  stable  for  which 
| o.|  > u>cr  . The  stability  regions  for  rolls  in  a wide  range 
of  Rayleigh-numbers  has  been  determined  numerically  in  [4] . 
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Mathematical  Problems  of  Dynamo 
Theory 
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1 . Introduction 

Dynamo  theory  is  concerned  with  those  solutions  of 
the  magnetohydrodynamic  equations  with  finite  magnetic 
fields  which  bifurcate  from  solutions  without  a magnetic 
field.  Expressed  in  physical  terms  this  bif urcatation  phe- 
nomenon may  be  described  in  the  following  way:  Velocity 
fields  without  an  electric  current  can  be  realized  in  an 
electrically  conducting  fluid  as  in  any  other  fluid. 
Electrically  conducting  fluids,  however,  offer  an  addi- 
tional degree  of  freedom  in  that  they  can  support  electric 
currents.  Dynamo  theory  deals  with  those  instabilities  in 
which  velocity  fields  with  electric  currents  arise  spon- 
taneously from  velocity  fields  without  electric  currents. 
The  instability  mechanism  is  referred  to  as  the  dynamo 
process . 

Since  magnetohydrodynamics  can  be  considered  as  an 
extension  of  hydrodynamics  to  the  case  of  electrically 
conducting  fluids,  .dynamo  theory  may  be  regarded  as  an 
extension  of  the  theory  of  hydrodynamic  instability  and 
turbulence.  Indeed,  dynamo  theory  parallels  the  latter  in 
many  respects  and  the  same  mathematical  methods  are  used 
in  both  subjects.  The  dynamo  problem  differs,  however, 
from  the  problem  of  hydrodynamic  stability  in  that  the 
bifurcating  solution  introduces  a new  physical  quantity, 
the  magnetic  field,  which  is  foreign  to  ordinary  hydro- 
dynamics. It  is  this  phenomenon  which  has  made  dynamo 
theory  a particularly  fascinating  subject  of  continuum 
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physics . 

Dynamo  theory  has  developed  fairly  recently.  It  was 
not  until  1958  that  Backus  []2[]  and  Herzenberg  [22] 
showed  convincingly  that  the  dynamo  process  of 
generation  of  magnetic  fields  in  a homogeneous  electri- 
cally conducting  fluid  is  possible.  Traditionally  there 
has  been  little  interest  in  the  subject  among  fluid  dynami- 
cists.  Since  it  is  difficult  to  achieve  magnetic  Reynolds 
numbers  in  the  laboratory  which  are  sufficiently  high  to 
provide  dynamo  action  and  since  it  is  unlikely  that  the 
homogeneous  dynamo  will  ever  have  technical  applications, 
there  has  been  little  reason  for  engineers  to  get  involved 
in  the  subject. 

On  the  other  hand  the  dynamo  process  plays  a promi- 
nent role  in  geophysical  and  astrophysical  applications. 

The  origin  of  geomagnetism  and  the  mechanism  of  the  solar 
magnetic  cycle  have  long  been  puzzling  problems  to 
natural  philosophers.  In  1919  Larmor  [25]]  was  the  first 
to  propose  the  dynamo  process  as  the  cause  for  the  genera- 
tion of  solar  magnetic  fields.  Cowling  [JL7]  made  an 
important  contribution  in  1934  by  demonstrating  that  axi- 
symmetric  and  two-dimensional  magnetic  fields  cannot  be 
generated  by  the  dynamo  process.  Cowling's  theorem  cast 
severe  doubts  on  the  dynamo  hypothesis  of  the  origin  of 
geomagnetism  since  the  observed  magnetic  field  of  the  earth 
is  nearly  axisymmetric . Only  the  absence  of  any  viable 
alternative  led  Elsasser  [18]  and  Bullard  []6[]  to  pursue 
the  dynamo  hypothesis  further.  Today  we  know  that  nearly 
all  velocity  fields  give  rise  to  dynamo  action  if  their 
amplitudes  are  sufficiently  high  and  that  the  generation 
of  magnetic  fields  with  minimal  deviations  from  axi- 
symmetry  is  possible.  Since  the  doubts  of  whether  the 
dynamo  process  in  the  liquid  outer  core  of  the  earth  is 
feasible  have  disappeared,  the  attention  of  theoretical 
geophysicists  has  shifted  towards  the  construction  of  more 
or  less  realistic  models  of  the  geodynamo.  Many  basic 
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problems  of  dynamo  theory,  however,  remain  unsolved,  and 
it  will  be  an  objective  of  this  paper  to  point  out  those 
of  particular  mathematical  interest. 

Our  review  of  basic  aspects  of  dynamo  theory  starts 
with  the  derivation  of  the  dynamo  equation  in  section  2. 
Some  insight  into  the  dynamo  process  can  be  gained  by  con- 
sidering the  homopolar  or  disk  dynamo,  which  is  described 
in  section  3.  In  hydrodynamics  it  is  often  important  to 
avoid  instability  and  a number  of  sufficient  conditions 
for  the  stability  of  steady  fluid  flow  have  been  derived. 

In  dynamo  theory  the  bifurcation  of  a solution  with  a mag- 
netic field  is  considered  a positive  event  which  is  also 
referred  to  as  dynamo  action.  For  this  reason  criteria 
similar  to  those  in  hydrodynamics  assume  the  form  of 
necessary  conditions  for  bifurcation.  The  simplest  neces- 
sary conditions  for  dynamo  action  result  from  energy  inte- 
grals, which  are  discussed  in  section  4.  The  close 
analogy  to  energy  bounds  in  the  theory  of  hydrodynamic 
stability  becomes  apparent  at  this  point.  The  anti- 
dynamo theorems  in  section  5 represent  a more  unique 
aspect  of  dynamo  theory.  The  theorems  have  played  an 
important  role  in  the  historical  development  of  the  field 
and  some  of  the  unproven  conjectures  continue  to  pose 
challenging  mathematical  problems.  Because  of  the  exten- 
sive literature  it  is  not  possible  to  give  a survey  of  all 
solutions  of  the  kinematic  dynamo  problem.  The  discussion 
in  section  6 is  therefore  restricted  to  general  aspects 
and  a particularly  simple  example.  In  recent  years  the 
focus  of  theoretical  efforts  has  moved  towards  the  magneto- 
hydrodynamic dynamo  problem,  which  is  considered  in  section 
7 . The  complexities  of  this  problem  have  so  far  prevented 
the  discovery  of  any  general  result,  such  as  a general 
relationship  determining  the  equilibrium  amplitude  of  the 
magnetic  field.  The  variety  of  phenomena  exhibited  by  the 
few  existing  solutions  for  magnetohydrodynamic  dynamos 
indicates  that  much  more  work  will  be  required  before  a 
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comprehensive  understanding  of  this  problem  is  achieved. 
The  paper  closes  with  some  remarks  on  the  applications  of 
dynamo  theory . 

2 . The  Dynamo  Equation 

The  mathematical  description  of  the  dynamo  process  is 
based  on  Ohm's  law  and  on  Maxwell's  equations  in  the 
magnetohydrodynamic  approximation  in  which  the  displace- 
ment current  is  neglected.  This  approximation  is  well 
satisfied  in  all  problems  of  interest  since  in  general  the 
fluid  velocities  measured  relative  to  an  inertial  system 
are  small  compared  to  the  velocity  of  light.  It  is  con- 
venient to  eliminate  from  the  basic  equations  all  vari- 
ables but  the  magnetic  induction  jg. 


In  the  last  step  the  assumption  has  been  made  that  the 
fluid  is  incompressible,  with  the  consequence  that  a sole- 
noidal  velocity  field  can  be  assumed.  The  dynamo  problem 
has  been  almost  exclusively  considered  under  this  restric- 
tion. The  inverse  of  the  product  of  conductivity  a and 
permeability  p is  called  the  magnetic  diffusivity  X.  In 
deriving  the  second  form  (2)  of  the  dynamo  equation  it  has 
been  assumed  that  X is  constant.  It  is  evident  from  the 
first  form  (1)  of  the  dynamo  equation  that  ^ remains  sole- 
noidal  if  its  initial  conditions  satisfy  this  property. 

For  this  reason  Maxwell's  equation  V • Jg  = 0 is  required 
only  for  the  initial  conditions.  For  geophysical  and 
astrophysical  applications  it  is  of  interest  to  note  that 
the  dynamo  equation  is  invariant  with  respect  to 
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transformations  to  rotating  systems. 

In  the  context  of  the  dynamo  problem  it  is  important 
to  emphasize  the  boundary  condition  for  the  magnetic  field 
^ at  infinity 

(3)  |£|  = o(  | ^ |“ 3 ) for  \%\  •+  °° 

where  £ is  the  position  vector.  Any  field  with  a "Source" 
at  infinity  cannot  be  considered  a solution  of  the  dynamo 
problem.  A brief  discussion  of  the  boundary  conditions  at 
surfaces  bounding  the  electrically  conducting  fluid  is 
given  in  the  Appendix. 

The  dynamo  problem  has  traditionally  been  considered 
at  two  different  levels.  The  kinematic  dynamo  problem 
addresses  itself  to  the  question  of  under  which  conditions 
equations  (1)  or  (2)  allow  solutions  ^ that  are  growing  in 
time  when  an  arbitrarily  chosen  solenoidal  velocity  field 
% is  given.  Since  the  dynamo  equation  is  linearly  homo- 
geneous, the  amplitude  of  ^ cannot  be  determined.  When  a 
steady  velocity  field  is  assumed  the  growth  of  the  mag- 
netic field  is  exponential,  which  is  clearly  unphysical 
beyond  a finite  range  of  time.  In  reality  the  velocity 
cannot  be  prescribed  independently  of  the  magnetic  field, 
since  it  must  satisfy  the  equations  of  motion  in  the  pre- 
sence of  the  Lorentz  force.  Accordingly,  the  full  magneto- 
hydrodynamic dynamo  problem  is  described  by  the  nonlinear 
system  of  the  equations  of  motion  and  the  dynamo  equation. 
The  nonlinear  coupling  of  the  equations  reduces  the  growth 
of  the  magnetic  field  and  leads  to  an  equilibration  of  the 

magnetic  energy.  Since  the  Lorentz  force  is  proportional 
2 

to  1^1  , however,  there  is  always  a finite  range  in  which 
the  kinematic  dynamo  problem  correctly  describes  the 
growth  of  the  magnetic  field. 

Restricting  our  attention  to  the  stationary  aspects 
of  the  problem  and  using  the  amplitude  A of  the  velocity 
field  as  a parameter  we  may  use  the  language  of  bifurca- 
tion theory  to  describe  the  two  parts  of  the  dynamo 
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problem.  The  kinematic  dynamo  problem  determines  the 
points  of  bifurcation  An  at  which  magnetic  solutions 
bifurcate  from  the  purely  hydrodynamic  basic  state.  The 
nonlinear  post-bifurcation  behavior  of  the  magnetic  solu- 
tion can  be  described  only  by  the  coupled  system  of  the 
equations  of  motion  and  the  dynamo  equation. 

3 . The  Disk  Dynamo 

In  spite  of  the  simplicity  of  the  dynamo  equation  (2) 
it  was  not  known  until  about  20  years  ago  whether  non- 
decaying solutions  did  exist.  This  may  appear  surprising 
in  view  of  the  fact  that 
the  dynamo  principle  has 
long  been  used  for  the 
generation  of  electro- 
magnetic energy  from 
mechanical  energy.  The 
main  difference  between 
the  dynamo  process  in  a 
contained  homogeneous 
fluid  and  in  technical 
dynamos  is  that  the 
latter  depend  on  a 
multiply  connected  dis- 
tribution of  electrical 
conductivity.  Since 
the  self-excited  tech- 
nical dynamo  is  other- 
wise quite  similar 

to  the  homogeneous  dynamo  it  is  of  interest  to  consider 
the  disk  dynamo,  which  provides  the  simplest  example  of 
dynamo  action. 

As  shown  in  figure  1,  the  disk  dynamo  consists  of  a 
metal  disk  connected  to  a shaft  which  is  rotating  with  a 
angular  velocity  ft.  An  initial  weak  magnetic  field  Jg 
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induces  a radial  electromagnetic  force  in  the  disk  which 
leads  to  a potential  difference  U between  the  rim  and  axis 
of  the  disk.  This  force  is  used  to  drive  an  electric  cur- 
rent I in  a circuit  which  is  closed  in  such  a way  that  the 
magnetic  field  of  the  current  has  the  same  direction  as 
the  initial  magnetic  field.  When  the  resistance  R of  the 
circuit  is  low  enough,  the  possibility  for  a self-excited 
dynamo  exists.  Mathematically  the  process  is  described  by 

(4a)  LI + RI  = U 

(4b)  U = JLmib 

2 7T 

where  L is  the  inductance  of  the  circuit  and  M is  the 
mutual  inductance  between  the  loop  and  the  disk.  Equa- 
tions (4)  yield  for  I and  hence  for  the  associated  magnetic 
field  a solution  of  the  exponential  form, 

8 = Soexp{f?“R)t/L} 

Growing  solutions  exist  when  the  condition  for  self- 
excitation 

(5)  Mfi/R  > 2ir 

is  satisfied.  The  complex  distribution  of  conductivity  of 
the  disk  dynamo  permits  a simple  rotational  motion.  In  the 
case  of  the  homogeneous  dynamo  the  opposite  extreme  is 
realized.  A uniform  distribution  of  conductivity  requires 
a complex  velocity  field  for  dynamo  action.  In  both  cases, 
however,  the  condition  for  self-excitation  or  "bifurcation" 
yields  expressions  of  the  form  (5) . 

The  hydromagnetic  problem  for  the  disk  dynamo  is 
obtained  when  the  angular  velocity  is  no  longer  regarded  as 
constant  and  the  torque  exerted  by  the  Lorentz  force  on  the 
disk  is  taken  into  account.  The  earlier  stability  analysis 
of  this  problem  by  Bullard  [^7]]  has  recently  been  extended 
into  the  nonlinear  regime  by  Robbins  [42]]  using  a modified 
dynamo  suggested  by  Malkus  [28] . In  this  case  the  disk 
dynamo  exhibits  nonlinear  oscillations  and  aperiodic 
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reversals  similar  to  those  found  in  previous  work  [26,  2, 
36]  on  systems  of  two  or  more  coupled  disk  dynamos.  The 
time-dependent  properties  of  disk  dynamos  resemble  phe- 
nomenologically at  least,  the  excursions  and  reversals  of 
the  earth's  magnetic  field  indicated  by  the  paleomagnetic 
records . 


4 . Energy  Bounds 

In  order  to  obtain  necessary  conditions  for  the  exis- 
tence of  growing  solutions  of  the  dynamo  equation  (2)  we 
consider  a finite  domain  V of  conducting  fluid.  For 
simplicity  we  assume  that  the  magnetic  diffusivity  X of 

the  fluid  is  constant  and  that  the  outside  space  V'  is  1 

insulating.  Multiplication  of  equation  (1)  or  (2)  by 
and  integration  over  V + V'  yields 

(6)  151  ; l^l2  = -X/|Vx^|2  + /;e.  (^x  (VX^)) 

V + V'  V V j 

In  deriving  this  equation  it  has  been  assumed  that  the  ■ 

i 

normal  component  of  vanishes  on  the  surface  of  V and  ( 

that  V Xjg  = 0 holds  in  V'  . For  details  we  refer  to  Backus 
p[]  . Using  the  maximum  velocity  u of  |^|  in  V the 
inequality 


1 _d_ 

2 dt 


; |jg|*  < - x/ j v x g | 

V + V'  V 


2 


(7)  + u { / |^|2/|V  x %\2}* 

V + V'  V 
2 

is  obtained  from  (6) . Let  k be  the  solution  of  the  varia- 
tional problem 


(8)  k2  = L2min{/|  V * r|  2/  / |jp|2) 

V V + V' 

where  L is  the  diameter  of  V and  where  the  variation  is 
extended  over  all  solenoidal  fields  ^ that  decay  towards 
infinity.  Since  (8)  always  has  a finite  solution  we  can 
use  the  positive  number  k to  rewrite  inequality  (7) 


(9) 


1 _d_ 

2 dt 


/ 

V + V' 


/ | V X b|2U  - u L/k) 

v 


Thus  we  obtain  as  a necessary  condition  for  dynamo  action 
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that  the  magnetic  Reynolds  number  exceeds  the  value  k 
(Childress  [15] ) , 

(10)  R = u L/A  > k 

. m m 

Backus  []2[]  has  calculated  the  value 

k = 2tt 
s 

for  a spherical  volume  of  diameter  L which  can  serve  as  a 
lower  bound  for  all  possible  values  of  k. 

The  derivation  of  the  necessary  condition  for  dynamo 
action,  R^  > k,  is  essentially  the  same  as  for  the  energy 
stability  criterion  for  viscous  flow  [43] . As  in  the 
latter  case  the  inequality  (9)  can  be  replaced  by 

(11)  i i f |b|2<  - f | B | 2 { Ak2/L2  - m} 

V + V'  V + V' 

where  m is  the  maximum  of  the  largest  principal  value  of 
the  rate  of  strain  tensor  e . . = i(3 . u.  + 3 . u . ) in  V.  The 

i]  2 ] i l j 

criterion 

(12)  mL2/A  > k2 

for  dynamo  action  was  first  derived  by  Backus  []2[]  . The 
advantage  of  inequality  (11)  is  that  it  remains  invariant 
with  respect  to  a superimposed  rotation. 

The  fact  that  the  dynamo  equation  does  not  contain  a 
pressure  term  as  do  the  Navier-Stokes  equations  for  an 
incompressible  fluid  allows  the  derivation  of  more  speci- 
fic criteria.  The  following  criterion  turns  out  to  be 
useful  in  geophysical  applications 

(13)  \ - { " x + D max  ’ V } f 

V V + V' 

where  £ is  the  position  vector  and  D is  defined  by 

D={/|£|2/  / I - B | 2 } 55 

V + V' 

Although  the  criterion  for  dynamo  action  implied  by 
inequality  (13)  depends  on  the  magnetic  field,  it  is 
applicable  to  the  earth's  core  since  upper  bounds  on  D are 
available  independently  [10] . 

Criteria  of  the  form  (10)  or  (12)  are  not  really 
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independent  of  the  magnetic  field  either,  since  the 
Lorentz  force  may  change  the  maximum  velocity  consider- 
ably. It  is  desirable  therefore  to  derive  criteria  which 
are  given  in  terms  of  forces  acting  on  the  fluid  or  veloc- 
ities prescribed  at  the  boundaries.  Such  criteria  have 
been  derived  in  the  case  of  hydrodynamic  stability  theory 
by  Payne  []33,  34]  . The  extension  to  magnetohydrodynamics 
appears  to  require  new  techniques  and  represents  a chal- 
lenging mathematical  problem. 

5 . Anti-Dynamo  Theorems 

The  most  famous  theorem  of  dynamo  theory  is  Cowling' s; 
which  states  that  axisymmetric  and  two-dimensional  magnetic 
fields  cannot  be  steady  solutions  of  the  dynamo  equation 
(2) . The  original  proof  by  Cowling  [17]  has  been  extended 
by  Backus  and  Chandrasekhar  [[3[]  to  include  toroidal  fields 
and  by  Lortz  [27]  to  allow  for  variations  of  magnetic 
dif fusivity . 

Because  of  Cowling's  theorem  it  was  doubtful  for  a 
long  time  whether  the  dynamo  hypothesis  of  the  origin  of 
the  earth’s  magnetic  field  is  feasible.  Later  work,  in 
particular  the  work  of  Braginskiy  [4,  5],  has  demonstrated 
that  magnetic  fields  with  arbitrarily  small  deviations  from 
axisymmetry  can  be  generated  by  the  dynamo  process  if  the 
magnetic  Reynolds  number  is  high  enough.  Since  Cowling's 
theorem  is  often  misquoted  in  the  literature  it  should  be 
emphasized  that  it  does  not  state  that  axisymmetric  and 
two-dimensional  velocity  fields  cannot  generate  magnetic 
fields.  In  fact,  these  velocity  fields  provide  the  most 
convenient  way  to  obtain  solutions  for  the  dynamo  problem. 

A theorem  which  is  probably  more  important  than 
Cowling's  theorem  from  the  practical  point  of  view  is  the 
toroidal  theorem.  It  is  implicitly  contained  in  an  early 
paper  by  Elsasser  [18[]  and  was  proven  rigorously  by 
Bullard  and  Gellman  []8[]  . This  theorem  is  based  on  the 
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property  that  any  solenoidal  vector  field  ^ can  be  written 
in  the  form 

(14)  Y = V x (V  x £<),)  + V x ^ 

with  arbitrary  scalar  functions  <|>  and  ip . The  first  term 
on  the  right  hand  side  of  (14)  is  the  poloidal  component 
of  the  velocity  field,  the  second  term  is  the  toroidal  com- 
ponent. The  toroidal  theorem  states  that  a velocity  field 
with  a vanishing  poloidal  component  cannot  give  rise  to 
steady  or  growing  solutions  of  the  dynamo  equation  (1)  if 
the  magnetic  diffusivity  A is  a function  of  |^|  only. 

Using  the  center  of  the  earth  as  the  origin  for  the  posi- 
tion vector  the  theorem  can  be  directly  applied  to  the 
earth's  core.  It  implies  that  for  the  geodynamo  a non- 
vanishing radial  velocity  is  required  since  the  latter  is 
associated  only  with  the  poloidal  field  of  motion. 

The  toroidal  theorem  can  also  be  formulated  for  a 
plane  geometry  using  the  general  representation 

(15)  y * v x (V  x Jc^)  + V x }cip 

for  a solenoidal  velocity  field  where  ^ denotes  a constant 
unit  vector.  The  theorem  states  that  a velocity  field  of 
the  form 

(16)  Y = V x ^ 

cannot  give  rise  to  dynamo  action  if  A is  a function  of 
• y only.  The  proof  is  sufficiently  simple  that  it  can 
be  described  in  a few  lines.  Using  the  representation 

$ = v x (V  X j£h)  + v X J^g 

for  the  magnetic  field  we  obtain  by  multiplying  equation 
(1)  by  £ 

(17)  (j£-  AV2)  A2h  = y ’ VA2h 
where  A2  is  defined  by 

a2  = V2  - (^  • V)2 

Equation  (17)  represents  the  heat  equation  with  A2h 
replacing  the  temperature.  The  variation  of  A corresponds 
to  variations  of  the  heat  capacity  rather  than  the  heat 
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conductivity.  Since  there  are  no  "heat  sources",  A2h  and 
consequently  the  poloidal  component  of  ^ must  decay. 
Assuming  that  this  decay  has  taken  place  we  obtain  an 
equation  for  g by  taking  the  ^-component  of  the  curl  of 
equation  (1) 

(18)  *2{Jt  + % ' V+  Xv2)g  = 0 

where  terms  involving  h have  been  neglected.  Using  the 
requirement  that  g must  be  bounded  at  infinity  we  obtain 
by  integration 

(19)  (^+  Y * V + Xv2)g  = f (Jj  • £) 

Without  affecting  the  magnetic  field  we  can  assume 

g = 0 

where  the  bar  indicates  the  average  over  any  plane  normal 
to  )c.  By  taking  this  average  over  equation  (19)  and 
using  ^ • Jc  = 0 we  find  f (J^  • £)  =0.  Thus  (19)  becomes 
identical  to  the  homogeneous  heat  equation,  which  yields 
only  decaying  solutions  for  g. 

The  proof  depends  critically  on  the  separation  of  the 
equations  for  the  poloidal  and  toroidal  parts  of  the  mag- 
netic field.  Since  this  property  cannot  be  carried  over 
to  the  case  of  a cylindrical  geometry,  it  has  not  yet  been 
possible  to  prove  the  theorem  in  the  cylindrical  case.  In 
fact,  a dynamo  has  recently  been  proposed  [4  8_J  which  would 
violate  such  a theorem.  However  it  is  believed  that  the 
respective  calculations  are  in  error  [41]  and  that  there 
does  not  exist  a dynamo  for  a cylindrically  symmetric 
function  X if  the  radial  component  of  the  velocity  field 
vanishes. 

The  literature  of  dynamo  theory  contains  a number  of 
conjectures  of  anti-dynamo  theorems  which  have  remained 
unproven.  The  most  interesting  conjecture,  first  men- 
tioned by  Bullard  and  Gellman  [jQ , concerns  the 
intriguing  possibility  of  an  "invisible"  dynamo,  i.e.  a 
dynamo  which  generates  a purely  toroidal  magnetic  field  in 
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a planetary  core.  Since  the  toroidal  field  in  a con- 
ducting sphere  does  not  continue  into  the  insulating  space 
outside,  in  contrast  to  the  poloidal  component,  it  would 
be  difficult  to  infer  the  existence  of  such  a dynamo.  It 
is  generally  believed  that  the  dynamo  equation  neither 
admits  a purely  toroidal  magnetic  field  as  a solution  nor 
the  twin  case  of  a purely  poloidal  magnetic  field.  (The 
reference  for  a proof  mentioned  in  [15]  does  not  contain 
such  a proof,  unfortunately.)  Another  unproven  conjecture 
of  geophysical  interest  is  that  a velocity  field  confined 
to  meridional  planes  cannot  generate  a magnetic  field.  It 
appears  desirable  to  obtain  a generalization  of  the 
toroidal  theorem  which  would  include  this  and  other  cases 
and  thereby  clarify  the  topological  nature  of  velocity 
fields  required  for  dynamo  action. 

6 . Kinematic  Dynamos 

It  is  not  possible  in  this  article  to  review  the 
numerous  solutions  of  the  dynamo  equation  (2)  that  have 
been  obtained  during  the  past  twenty  years.  The  most 
extensive  and  up  to  1970  nearly  complete  review  has  been 
given  by  P.H.  Roberts  [39]  to  which  we  refer  the  interested 
reader.  Here  we  shall  restrict  our  attention  to  some 
general  features  and  open  questions  of  kinematic  dynamos. 

A common  feature  of  nearly  all  homogeneous  dynamos  is 
that  a large  scale  magnetic  field  is  generated  by  a small 
scale  velocity  field.  Even  numerical  models  that  do  not 
require  a difference  in  scales  as  part  of  the  method  of 
solution  exhibit  the  tendency  that  dynamo  action  dis- 
appears when  the  scale  of  the  motion  becomes  comparable  to 
that  of  the  large  scale  magnetic  field  (see,  for  instance, 
Gubbins  [21] ) . Although  the  reduction  of  Ohmic  dissipation 
because  of  the  large  scale  of  the  main  component  of  the 
magnetic  field  enhances  the  chances  for  dynamo  action,  the 
primary  reason  for  the  separation  of  scales  is  the  effi- 
cient two-step  interaction  with  the  velocity  field:  the 
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advection  of  the  large  scale  magnetic  field  by  the  small 
scale  velocity  field  creates  a small  scale  magnetic  field. 
Since  this  latter  field  and  the  velocity  field  have 
approximately  the  same  scale,  their  interaction  can  create 
a mean  electromotive  force  which  is  required  to  produce 
the  large  scale  field. 

A second  feature  of  efficient  dynamo  action  is  the 
property  of  helicity  of  the  velocity  field.  Helicity  is 
define u by 

« * V x K 

where  the  spatial  average  may  be  extended  over  two-  or 
three-dimensional  domains.  Helicity  is  neither  sufficient 
nor  necessary  for  dynamo  action.  A velocity  field  of  the 
form  (16)  may  have  finite  helicity,  even  though  it  cannot 
provide  dynamo  action  according  to  the  toroidal  theorem. 

On  the  other  hand  there  are  dynamo  solutions  [38,  9]  based 
on  velocity  fields  with  vanishing  helicity.  In  these 
cases,  however,  the  interaction  of  the  fluctuating  fields 
which  generates  the  mean  magnetic  field  is  of  higher 
order. 

To  illustrate  some  typical  features  of  dynamo  solu- 
tions we  describe  briefly  what  is  likely  to  be  the 
simplest  example  of  a spatially  periodic  dynamo.  We 
assume  a two-dimensional  velocity  field  of  the  form 
(20a)  ^ = V x 

where  is  the  unit  vector  in  the  z-direction  and  ip  and  w 
are  given  by 

(20b)  ip  = A sin  ax  sin  ay;  w = Cip/A 

Since  neither  z nor  t appear  explicitly  in  the  dynamo 

equation 

Jg  « exp{pt  + iyz} 

can  be  assumed.  Denoting  the  x,  y-average  by  a bar  we 
obtain  separate  equations  for  the  mean  jg  and  the  fluctuat 
ing  field  = Jg  - Jg  by  first  taking  the  x,  y-average  of 
equation  (2)  and  then  subtracting  it  from  equation  (2) , 
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(21a)  (p+  Xy2)£  = iyjj  * * $) 

(21b)  (p-XV2)^=  - iyw|  + | • 7^  + . . . 

On  the  right  hand  side  of  (21b)  we  have  neglected  terms 
involving  J^S  since  we  shall  assume 

igi « ill 

. . 2 2 
This  assumption  is  justified  in  the  limit  y <<  a if  we 

. . 2 
restrict  p to  the  order  of  magnitude  of  Xy  as  is  sug- 
gested by  equation  (21a) . The  solution  for  ^ is  readily 
obtained, 

'k  - ~h 1-  7*+--- 

2Xa 

and  its  use  in  equation  (21a)  yields 

(22)  (P+Xy2)|  = -iyjf 

This  equation  is  solved  by 

(23)  B + iB  = B, exp{pt  + iyz } , 

x y x 

(24)  p = - Xy2  + yAC/4 X 

where  B^  is  a constant  depending  only  on  the  initial  con- 
ditions. Obviously,  steady  or  growing  solutions  exist  for 

(25)  X /cxAC/ay  > 2 

This  dynamo  condition  is  written  in  such  a form  that  the 
left  hand  side  represents  the  harmonic  mean  of  the  mag- 
netic Reynolds  number  with  respect  to  the  small  length 
scale  a ^ and  the  large  scale  y The  lines  of  the  mag- 
netic field  (23)  resemble  the  steps  of  a right  hand  stair- 
case for  AC  > 0.  The  same  sense  of  turning  is  exhibited  by 
the  velocity  field  corresponding  to  a positive  helicity. 

An  illustration  is  given  in  figure  2.  The  above  dynamo 
represents  one  of  G.O.  Roberts'  [38]  dynamos  in  the  limit 
of  small  y and  we  refer  to  his  paper  for  a more  exact 
treatment.  Periodic  kinematic  dynamos  based  on  three- 
dimensional  velocity  fields  have  been  considered  by 
Childress  Q(T]  and  G.O.  Roberts  [37]  . 

The  property  exhibited  by  equation  (22)  that  the 
interaction  of  small  scale  velocity  and  magnetic  fields 
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Fig.  2:  a)  Turning  sense  of  velocity  field 

b)  Mean  magnetic  field  B as  given  by  (23) 

provides  a mean  electromotive  force  has  led  Steenbeck, 
Krause,  and  Radler  [47]  to  propose  the  concept  of  mean 
field  electrodynamics  in  which  the  dynamo  equation  (2)  is 
replaced  by 

(26)  + XV  x (V  x |)  = «V  x£ 

in  the  simplest  case.  The  quantity  a is  proportional  to 
the  helicity  of  the  small  scale  turbulent  velocity  field. 

A finite  value  of  a is  commensurate  with  the  assumption  of 
isotropy  as  long  as  the  turbulence  is  not  mirror  symmetric. 
In  general  a must  be  replaced  by  a tensor  in  order  to  take 
into  account  the  non-isotropy  of  the  small  scale  velocity 
field. 

Because  of  their  simplicity  equation  (26)  and  its 
extensions  which  include  the  effects  of  large  scale 
motions  have  been  favored  in  applications  of  dynamo  theory 
to  problems  of  geophysical  and  astrophysical  interest. 
Starting  with  the  early  work  of  Parker  Q2]  w^°  used  an 
equation  similar  to  (26)  the  applications  have  demonstrated 
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that  turbulent  dynamo  theory  is  capable  of  modeling  the 
generation  of  observed  magnetic  fields.  Because  of  the 
simplifying  assumptions  and  because  of  the  unknown  proper- 
ties of  the  turbulence,  however,  the  models  are  not  unique 
and  do  not  offer  opportunities  for  detailed  comparison 
with  observational  data. 

7.  The  magnetohydrodynamic  dynamo  problem 

The  nonlinear  magnetohydrodynamic  dynamo  problem  is 
an  unusual  bifurcation  problem  of  intrinsic  mathematical 
and  physical  interest  besides  its  important  applications. 
Because  the  bifurcating  solution  is  characterized  by  a new 
physical  quantity  it  is  governed  by  a significantly  dif- 
ferent system  of  equations  than  the  hydrodynamic  solution 
from  which  it  bifurcates.  Unfortunately,  the  same  proper- 
ties which  make  the  problem  particularly  interesting  also 
contribute  to  its  difficulties  and  at  the  present  time  the 
theoreticians  of  the  field  are  far  from  comprehending  the 
variety  of  phenomena  which  are  described  by  the  full  set 
of  coupled  equations. 

Before  we  enter  the  discussion  we  wish  to  mention  the 
two  most  urgent  reasons  for  an  understanding  of  the 
magnetohydrodynamic  problem  from  the  point  of  view  of 
applications.  A striking  property  of  stellar  and  plane- 
tary magnetic  fields  is  the  large  variation  of  magnetic 
field  strength.  In  many  cases  this  observational  quantity 
may  well  contain  the  key  for  the  understanding  of  the 
underlying  dynamo  mechanism.  While  kinematic  dynamo 
theory  can  predict  the  form  of  the  magnetic  field  if  the 
velocity  field  is  known,  it  cannot  provide  an  estimate  for 
the  amplitude.  Only  the  magnetohydrodynamic  dynamo  prob- 
lem incorporates  the  Lorentz  force,  the  balance  of  which 
with  the  other  forces  in  the  equation  of  motion  determines 
the  equilibrium  value  of  the  magnetic  energy. 

An  even  more  important  limitation  of  kinematic  dynamo 
models  is  their  lack  of  uniqueness.  In  contrast  to 
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earlier  views  which  associated  dynamo  action  with  special 
velocity  fields  only,  it  has  become  evident  as  the  number 
of  solutions  increased  during  the  past  decade  that  nearly 
all  velocity  fields  possess  dynamo  action  if  the  magnetic 
Reynolds  number  is  high  enough.  Hence  little  can  be  con- 
cluded from  the  form  of  large  scale  magnetic  fields  about 
the  velocity  fields  that  provide  the  dynamo  action.  Since 
direct  observations  of  the  relevant  velocity  fields  are 
usually  scarce  the  degeneracy  of  kinematic  models  can  only 
be  resolved  if  the  equations  of  motion  are  solved.  Even 
though  the  forces  are  not  known  either  in  many  cases,  the 
choice  of  physically  reasonable  forces  is  much  more 
restricted  and  solutions  of  the  full  magnetohydrodynamic 
dynamo  problem  are  likely  to  provide  additional  con- 
straints such  that  a unique  determination  of  the  dynamo 
process  becomes  possible. 

A typical  difficulty  in  solving  the  magnetohydro- 
dynamic dynamo  problem  is  encountered  already  at  the  level 
of  the  purely  hydrodynamic  solution.  There  does  not  seem 
to  exist  an  exact  analytical  solution  of  the  Navier-Stokes 
equations  for  physically  realizable  forces  which  provides 
dynamo  action.  Indeed,  nearly  all  known  exact  solutions 
can  be  written  in  the  form  (16)  which  is  barred  from 
dynamo  action  by  the  toroidal  theorem.  Thus  the  solutions 
of  the  magnetohydrodynamic  dynamo  problem  depend  on  hydro- 
dynamic  approximations  in  addition  to  those  connected  with 
the  magnetic  part. 

Thermal  convection  in  rotating  or  non-rotating  layers 
heated  from  below  has  been  used  in  all  magnetohydro- 
dynamic models  since  it  offers  convenient  small  scale 
motions  in  addition  to  its  relevance  to  geophysical  and 
astrophysical  applications.  The  fact  that  convection 
itself  represents  a bifurcating  solution  from  a basic 
static  state  facilitates  the  mathematical  analysis.  The 
equations  of  motion  for  the  velocity  vector  and  the 
heat  equation  for  the  deviation  6 of  the  temperature  from 
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the  static  solution  Tq  are  given  by 

(27a)  ^ + Y*VY=  - Vti  - Bg6  + vV2^  + ^(V  x £)  x £ 

(27b)  V • ^ = 0 

(27c)  ^e-0+y*V0=  ~y*VTo+kV20 

where  3,  v and  k denote  the  coefficient  of  expansion,  the 
kinematic  viscosity,  and  the  thermal  diffusivity,  respec- 
tively. g is  the  gravity  vector.  The  Boussinesq  approxi- 
mation has  been  used  which  takes  into  account  the 
temperature  dependence  of  the  density  in  the  gravity  term 
only.  All  terms  that  can  be  written  as  gradients  are 
included  in  Vir  since  they  do  not  affect  the  dynamics  of 
the  system.  In  the  case  of  a rotating  system  the  Coriolis 
force  term  - 2jlx  ^ must  be  added  on  the  right  hand  side  of 
equation  (27a) . 

The  only  systematic  analytical  method  for  the 
investigation  of  the  post-bifurcation  behavior  of  dynamo 
solutions  is  the  expansion  in  powers  of  the  amplitude  of 
the  magnetic  field.  Since  the  magnetic  field  enters  the 
equations  of  motion  (27a)  quadratically , the  velocity 
field  can  be  expressed  in  an  expansion  of  even  powers 
while  the  magnetic  field  is  given  by  an  expansion  in  odd 
powers.  By  the  same  argument  (^ , - Jg)  is  a solution  of 
the  magnetohydrodynamic  dynamo  problem  for  any  solution 
(^ , Jg),  which  is  the  basic  reason  for  the  fact  that  both 
polarities  of  the  earth's  magnetic  field  have  occurred 
about  equally  in  the  geologic  history  of  the  earth. 

The  analysis  starts  with  the  hydrodynamic  problem 
described  by  equations  (27)  with  JjJ  = 0 and  appropriate 
boundary  conditions.  After  the  velocity  field  has  been 
determined  the  kinematic  dynamo  problem  (2)  is  solved  in 
the  second  step.  Among  the  solutions  for  the  magnetic 
field  the  fastest  growing  mode  is  assumed  to  be  the  physi- 
cally realized  one.  The  Lorentz  force  exerted  by  this  mode 
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is  introduced  as  a perturbation  into  equation  (27a)  and 
the  resulting  modification  of  the  velocity  field  is  calcu- 
lated in  the  third  step.  In  the  fourth  step  the  modifica- 
tion of  the  velocity  field  is  taken  into  account  in  the 
dynamo  equation  (2).  According  to  Lenz'  rule  it  can  be 
expected  that  the  modification  leads  to  a reduction  of  the 
growth  rate  of  the  magnetic  field.  Since  the  velocity 
modification  grows  proportional  to  | Jg  | the  growth  rate 
decreases  monotonically  until  it  vanishes,  at  which  point 
the  equilibrium  amplitude  of  the  magnetic  field  is 
reached.  This  behavior  characterizes  steady  dynamos  as 
well  as  those  producing  an  oscillatory  magnetic  field,  in 
which  case  the  above  conclusions  apply  to  the  mean 
amplitude . 

While  the  four-step  procedure  is  generally  applicable, 
it  has  been  applied  only  to  the  case  of  convection.  In 
accomplishing  the  first  step  it  is  not  a trivial  matter  to 
avoid  motions  of  the  form  (16)  since  in  general  the  most 
simple  kind  of  convection  in  the  form  of  rolls  is  physi- 
cally realized  [43,  2£J  . For  this  reason  convection  rolls 
with  a superimposed  mean  shear  flow  were  used  in  an  early 
magnetohydrodynamic  dynamo  []9£]  . The  equilibrium  value  of 
the  magnetic  energy  in  this  case  is  of  the  order  of  the 
change  of  the  gravitational  energy  of  the  layer  owing  to 
convection  multiplied  by  the  ratio  of  magnetic  to  viscous 
diffusivity.  Soward's  dynamo  model  £45]  based  on  convec- 
tion in  a rotating  layer  requires  a three-dimensional  con- 
vection pattern.  Although  steady  three-dimensional 
convection  solutions  are  unstable  £24]  , Soward  finds  that 
there  exist  time-dependent  three-dimensional  magnetic  solu- 
tions which  appear  to  be  stable.  While  Soward  investigated 
the  case  of  asymptotic  high  Taylor  numbers  for  which  the 
horizontal  scale  of  motion  becomes  small  compared  to  the 
vertical  scale,  Iooss  £Sf]  has  more  recently  analyzed  the 
case  of  moderate  Taylor  numbers.  Using  functional 
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analytic  methods  he  gives  a comprehensive  discussion  of 
the  various  possibilities  of  bifurcation.  Of  particular 
geophysical  relevance  is  the  case  of  convection  with 
gravity  and  rotation  vectors  at  right  angles.  Convection 
columns  aligned  with  the  axis  of  rotation  are  realized  in 
this  case.  They  are  representative  of  motions  in  the 
earth's  core  if  the  Lorentz  force  is  small  compared  to  the 
Coriolis  force.  The  dynamo  model  Q.1]  based  on  this  type 
of  convection  exhibits  a steady  magnetic  field  in  the  form 
of  a dipole  aligned  with  the  axis  of  rotation,  which 
agrees  in  first  approximation  with  the  earth's  magnetic 
field.  Since  the  model  shares  with  the  other  models  the 
restriction  to  weak  magnetic  fields,  a prediction  for  the 
amplitude  of  the  earth's  magnetic  field  cannot 
be  made. 

Because  Lenz'  rule  does  not  have  the  character  of  a 
rigorous  law  in  magnetohydrodynamics  it  is  not  surprising 
that  the  equilibrium  amplitude  of  the  magnetic  field  is 
not  always  reached  monotonously.  Soward  [45]  finds  that 
for  sufficiently  vigorous  convection  no  equilibrium  value 
can  be  obtained  within  the  framework  of  his  model.  The 
property  that  the  critical  value  for  dynamo  action 
decreases  with  increasing  magnetic  field  strength  corres- 
ponding to  unstable  bifurcation  must  be  regarded  as  the 
rule  rather  than  the  exception  in  rotating  systems  since 
the  Lorentz  force  tends  to  counteract  the  stabilizing 
influence  of  the  Coriolis  force.  An  example  of  this 
phenomenon  has  been  studied  recently  Q.3]  in  an  extension 
of  the  model  QQ  . In  a particular  case  the  nonlinear 
dynamo  equation  for  the  dimensionless  magnetic  field  jjj  can 
be  expressed  in  a simple  form  in  analogy  to  the  kinematic 
equation  (22) , 

<28a)  a^-Sxf[c/  li?l2d2-z(6  + o/1  |||2dz)]+d4 
where  ^ represents  the  unit  vector  in  the  y-direction  and 
d is  the  constant  of  integration.  The  boundary  conditions 
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are 

(28b)  Jg(0)  = 0;  ^ • Jg(l)  = 0 

Solutions  of  this  nonlinear  boundary  value  problem  exist 
at  much  lower  values  of  6 when  f \g\  dz  is  finite  than 
when  it  vanishes  as  in  the  linear  limit  of  problem  (28) . 

Various  aspects  of  magnetohydrodynamic  dynamo  theory 
have  been  considered  on  the  basis  of  turbulent  dynamo 
action.  Moffatt  [30,  31]  and  Soward  [46j  considered 
dynamo  action  by  random  inertial  waves  in  a rotating  sys- 
tem. Malkus  and  Proctor  [29]  investigated  the  effect  of 
a large  scale  velocity  field  induced  by  the  Lorentz  forces 
of  a turbulent  dynamo.  Frisch  et  al.  [20] , and  Pouquet  et 
al.  [35]  have  emphasized  the  interaction  between  the  heli- 
cities  of  magnetic  and  velocity  fields  as  a dominant  fea- 
ture of  turbulent  dynamo  action.  The  main  difficulty  of 
these  theories  is  that  the  concept  of  an  isotropic  turbu- 
lence with  finite  helicity  appears  to  be  too  idealized  to 
be  directly  applicable  to  physically  realized  dynamos. 

The  variety  of  effects  exhibited  in  the  idealized  case  of 
isotropic  magnetic  turbulence  gives  some  indication,  how- 
ever, of  the  much  larger  variety  of  phenomena  to  be 
expected  in  more  realistic  cases  of  the  magnetohydro- 
dynamic problem. 

8 . Concluding  remarks 

It  is  evident  from  the  discussion  in  the  preceding 
section  that  answers  to  some  of  the  key  questions  of  non- 
linear dynamo  theory  such  as  the  question  of  the  equi- 
librium amplitude  of  the  magnetic  field  are  still  elusive 
except  in  the  small  amplitude  limit.  This  is  not  sur- 
prising in  view  of  the  fact  that  similar  questions  in  the 
case  of  hydrodynamic  turbulence  cannot  be  answered  on  the 
basis  of  present  theories.  There  is  hope,  however,  that 
in  special  cases,  as  for  example,  rapidly  rotating  sys- 
tems, the  fully  nonlinear  dynamo  may  be  more  readily 
accessible  to  theoretical  analysis. 


MATHEMATICAL  PROBLEMS  OF  DYNAMO  THEORY 


197 


From  geophysical  and  astrophysical  observations  it  is 
evident  that  there  is  a close  relationship  between  rota- 
tion and  the  dynamo  process.  It  is  generally  believed 
that  a dynamic  reason  for  the  generation  of  a magnetic 
field  is  that  it  tends  to  alleviate  the  constraint  of 
rotation  on  fluid  motion.  Indeed,  even  the  simple 
example  of  convection  in  a Ic.yer  heated  from  below  shows 
that  while  magnetic  field  and  rotation  inhibit  convection 
when  occurring  separately  their  inhibitory  effect  is 
strongly  reduced  when  they  occur  together  jj.4]  . Roberts 
and  Stewartson  [jiff]  have  recently  studied  nonlinear 
aspects  of  this  interaction  and  derived  arguments  for  a 
balance  between  Coriolis  force  and  Lorentz  force  in  the 
earth's  core.  It  can  be  argued  that  only  the  ageo- 
strophic  component  of  the  Coriolis  force  requires 
balancing  by  the  Lorentz  force  and  that  for  this  reason 
the  strength  of  the  magnetic  field  in  the  earth's  core  is 
much  lower  than  commonly  assumed  jj.fl  . The  balance  of 
the  ageostrophic  component  of  the  Coriolis  force  in  the 
case  of  convection  appears  to  yield  a relationship  for 
the  amplitude  of  the  magnetic  field  which  compares  well 
with  the  values  of  the  three  planetary  magnetic  fields 
for  which  a dynamo  origin  can  be  assumed  and  detailed 
measurements  are  available  jj.2] . Hopefully,  these  specu- 
lations can  be  replaced  by  rigorous  theoretical  arguments 
in  the  not  too  distant  future. 

Appendix:  Boundary  conditions 

The  boundary  conditions  at  a stationary  surface  which 
separates  regions  of  different  magnetic  diffusivity  X are 
usually  given  in  terms  of  the  magnetic  field  and  the 
electric  field  £ 

(Al)  DO  = 0 

(A2)  [q  * Jf]  = o 

where  q is  the  normal  unit  vector  of  the  surface  and  the 
brackets  indicate  the  difference  between  the  values  of  the 
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enclosed  quantity  on  both  sides  of  the  interface.  It  has 
been  assumed  that  magnetic  permeability  does  not  change 
across  the  interface. 

The  condition  (A2)  has  two  disadvantages.  First,  it 
is  not  independent  of  condition  (Al) ; secondly,  it  is  for- 
mulated in  terms  of  the  electric  field  Jg  which  has  been 
eliminated  from  the  dynamo  equation  (1).  In  order  to 
obtain  a more  suitable  boundary  condition  we  assume  for 
simplicity  the  case  of  a plane  interface.  Replacing  by 
the  constant  normal  vector  £ of  the  plane  we  find  that  (A2) 
is  equivalent  to 

(A3)  [v->$x£]=o,  Dfc  • Vx  (jgxjgg  = 0 

except  in  the  case  of  a homogeneous  field  Jg,  which  is  not 
of  much  interest  in  dynamo  theory  and  can  be  considered 
separately  if  necessary.  Since  V • £ x jg  = - £ • V x jg  the 
first  of  conditions  (A3)  is  included  in  condition  (Al) 
because  of  Maxwell's  equation  3/3t  Jg  = - V x jg.  The  second 
condition  (A3)  can  be  expressed  in  terms  of  Jg  by  using 
Ohm's  law, 

(A4)  = 0 

In  the  derivation  of  (A4)  it  has  been  used  that  ^ 
vanishes  at  the  interface.  Conditions  (Al)  and  (A4)  rep- 
resent necessary  and  sufficient  conditions  at  a plane 
interface.  For  the  spherical  surface  a condition  similar 
to  (A4)  can  easily  be  derived.  In  special  cases  where  the 
outside  medium  is  insulating  (Aq/A^ ->•«>)  or  infinitely  con- 
ducting ( “*■  0 ) simplifications  of  the  conditions  can 
be  achieved. 
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Bifurcation  Theory  and  Permanent 
Waves 

E.  Ze  idler 


The  purpose  of  this  paper  is  to  give  an  overview  of  some  aspects 
of  existence  proofs  for  permanent  waves  that  have  recently  been  pub- 
lished by  the  author  using  a uniform  method  (cf.  [34]).  The  subjects 
treated  here  include; 

- relationships  between  sinusoidal  capillary-gravity  waves  and 
cnoidal  capillary -gravity  waves; 

- capillary-gravity  waves  of  two  superposed  fluids; 

- capillary-gravity  waves  with  general  vorticity  distributions; 

- tidal  waves  and  capillary-gravity  waves  in  circular  channels; 

- flow  around  a gas  bubble  with  a prescribed  parallel  and  circu- 
latory flow  at  infinity,  the  force  of  gravity  being  taken  into  con- 
sideration. 

The  last  section  of  this  article  deals  with  a number  of  open  problems. 

The  theory  of  permanent  waves  has  a long  history.  Airy, 
Boussinesq,  Kelvin,  Korteweg,  de  Vries,  Rayleigh,  and  Stokes  were 
among  the  nineteenth-century  investigators  who  concerned  themselves 
with  this  theory.  The  first  rigorous  proofs  of  the  existence  of  a perman- 
ent wave  motion  (infinite  -depth  gravity  waves)  were  independently  given 
by  Nekrassov  (1921)  and  Levi-Civita  (1924).  Investigators  who  there- 
after occupied  themselves  with  this  subject  include  Struik,  Kotschin, 
Lichtenstein,  Dubrell-Jacotin,  Lavrentyev,  Friedrichs,  Hyers,  and 
Krasnoselski.  In  the  past  twenty  years,  numerous  other  mathematicians 
also  concerned  themselves  with  the  subject  (cf.  [31],  [34]). 

Our  own  studies  followed  the  lines  set  out  by  the  work  of 
H.  Beckert  ([6,7,8,  9]).  The  objective  of  these  studies  was  to 
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- simplify  the  constructive  existence  proofs  given  by  Levi- 
Civita,  Struik,  Kotschin,  and  Dubreil-Jacotin; 

- replace  topological  existence  proofs  by  constructive  existence 
proofs; 

- produce  existence  proofs  for  problems  that  have  not  yet  been 
dealt  with; 

* 

- study  correlations  between  the  various  problems. 

Our  treatment  of  the  problems  is  by  the  steps  summarized  below; 

- The  plane  hydrodynamic  problem  is  expressed  as  a nonlinear 
free  boundary  value  problem  for  the  Laplace  eguation  or  more 
general  nonlinear  elliptic  differential  equations  (in  the  case  of 
vorticity  distributions). 

- A conformal  or  quasiconformal  mapping  gives  a nonlinear  bound- 
ary value  problem  with  an  eigenvalue  parameter  for  the  Cauchy- 
Riemann  differential  equations  or  for  more  general  integro-dlf- 
ferential  equations  (in  the  case  of  vorticity  distributions).  The 
equations  are  defined  in  the  unit  circle  for  infinite  depth 
channels,  in  an  annulus  for  the  finite  depth  case,  or  in  nonlin- 
early  coupled  unit  circles  or  annuli  for  superposed  fluids. 

- These  problems  may  be  studied  with  the  aid  of  bifurcation  theory. 
Only  a few  basic  principles  will  be  outlined  below.  Information  about 
further  contributions  to  these  problems  can  be  found  in  the  references 
(see  especially  [31],  [34])  at  the  end  of  this  paper.  In  particular  see 
[34]  for  more  details  on  the  mathematical  formulation  of  the  problems 
studied  below. 

§ 1.  Capillary-Gravity  Waves 

The  waves  considered  here  are  plane  waves  of  wavelength  X.  , 
under  the  action  of  a gravitational  accelleration  g and  surface  tension 
p which  propagate  with  a constant  velocity  c in  a channel  of  depth  h . 
The  shape  of  the  surface  y = yQ(x)  is  fixed  in  an  x,  y coordinate  sys- 
tem moving  with  the  wave. 
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Figure  1 


To  solve  this  free  boundary  value  problem,  we  seek  y = Yq(ii) * * * * * * * x)  an^  c = 
c(X)  . We  assume  that  the  fluid  is  of  density  p and  is  lnvlscld,  homog 
eneous,  and  incompressible,  and  that  the  motion  is  lrrotational.  Stand- 
ard approximations  [34]  to  the  partial  differential  equations  which  de- 
scribe this  problem  show  that  there  are  two  types  of  waves  for  X < » , 
namely 

i)  sinusoidal  waves  (Kelvin,  1871) 


(1) 


yow  - n 


sin 


2ttx 


.gX  2irft  2irh 

(■2—  + ) tanh  — — 

'2ir  Xp  ' X 


ii)  cnoidal  waves  (Korteweg,  de  Vries,  1895) 

(2)  yQ(x)  = 3aZh(l  - )k2  • cnZ(^y  , k) 

gph 

cZ  = gh,  X = 4K(k)  h/3a,  0 < k < 1 , 

where  cn(. ) is  Jacobi's  elliptic  function  and  K(k)  is  the  complete 

elliptic  integral  of  the  first  kind. 

In  (1)  and  (2)  s and  a are  small  parameters  so  h « X always 

holds  in  li).  This  is  called  shallow  water  theory.  What  is  also  impor- 

tant is  the  existence  in  ii)  of  a critical  depth 
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(3)  hQ  = v/  3 p/g»i  . 

For  h < hQ  (and  It  is  only  in  this  particular  case  that  the  author  suc- 
ceeded in  giving  an  existence  proof),  yQ(x)  has  a negative  sign  giving 
us  a negative  cnoidal  wave  as  compared  with  the  case  of  pure  gravity 
waves  where  the  surface  tension  p = 0 . 

Exact  solutions  of  the  free  boundary  problem  which  are  power 
series  expansions  with  respect  to  the  small  parameters  s and  a can 
be  determined.  We  pose  the  following  questions; 

- How  can  these  exact  solutions  be  found  and  their  Taylor  co- 
efficients be  calculated  ? 

- How  are  i)  and  ii)  related? 

- Can  one  let  h -*  <*  and  p -►  0,  respectively,  in  the  exact  solu- 
tions in  order  to  obtain,  in  i),  infinite  depth  capillary-gravity 
waves  ( h = °°)  and  finite  depth  gravity  waves  (p  = 0)? 

It  will  be  seen  that  the  answer  to  these  questions  involves  an  interesting 
bifurcation  effect  occurring  because  our  problem  leads  to  a nonlinear 
boundary  value  problem  for  the  annulus  {t,:  0<q<  |t,|  < 1}.  Here  q 
is  a parameter  in  terms  of  which  we  will  determine  the  allowable  range 
of  values  of  h . 

If  0 < q < 1,  we  get  sinusoidal  capillary-gravity  waves  resulting 
from  bifurcation  from  simple  eigenvalues; 
as  q ■*  0,  we  get  sinusoidal  infinite  depth  capillary-gravity 
waves; 

and  as  q-*l,  cnoidal  waves  are  obtained  in  an  Interesting  case  of 
asymptotic  bifurcation. 

Mathematically,  the  problem  is  to  find  an  analytic  function 
W = <p  + 14>  in  the  region  of  flow  ratifying  the  boundary  conditions; 
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(4)  C gpyQ(x)  + ^-nl  W'(z)  I2  + pQ  - (3  ^ = constant-^ 


= o 


«4>  = = constant  at  y = -h 


) 


at  y = yQ(x) 


(Here  pQ  = external  pressure,  = arctan  y^(x),  t = arclength  along 
y = yQ(x),  and  z = x + ty).  Making  the  ansatz 

W = c e’*u,  w = $ + It  , 

after  a conformal  mapping  [34],  the  following  boundary  value  problem  re- 
sults for  the  annulus 


Kq  = {?,  = P eia*  0 < q < p < 1} 

(where  p,<r  are  polar  coordinates). 

Required  are  constants  b,  p > 0,  and  an  analytic  function  in  Kq,  u - 
$ + iT,  satisfying  the  following  boundary  conditions 

2 o- 

(5)  -b  ^ = j (e  T + d( «,  t)  e'T)  ~ -P  ZT  f e‘T  sin  tfdo- 

c c o 

s F(«,  t)  at  p = 1 
•0  = 0 at  p = a 


and  symmetry  conditions 


tf(p,  -o')  = -fl(p,<r),  r(p,  -<r)  = T(p,a) 

TT 

/ T(q,  a)d(T  = 0 , 

-IT 


where 
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2it(3  _ gX. 

b = — "y  , p = y 

pXc  2 ire 


C 1 r1'  -T(q,<r) 

’ c = 2 7 J 6 da 


, 2irh  — ^0  , . 

q = exp(-—  ),  h = = mean  depth  . 

K C 


The  functional  d(tf,  t)  in  (5)  is  chosen  so  that 

TT 

/ F(0,  T)d<r  = 0 . 

-IT 

In  order  to  be  able  to  study  the  relationship  between  sinusoidal 

and  cnoidal  waves,  this  problem  is  transformed  into  an  operator  equation 

in  the  Banach  space  C’  0 < a < 1,  of  Holder  continuous  functions  on 

a 

the  boundary  of  the  unit  circle  p = 1 with  the  symmetry  condition  0(1,  -o-) 
= - 0(1,  <r)  . 

For  each  function  ^(1,  <r)  t C"  there  is  exactly  one  analytic  func- 

o 

tion  w = 0 + It,  in  K , with 

’ q’ 


0(q,<r)  = 0 , / T(q,o-)d<r  = 0 


We  set 


and 


t(1,ct)  = -X  0(1,  <r) 


. TT  <T 

Tf(«r)  = f(«r)  - j-  / f(<r)d<r,  Jf(<r)  ■ T / Tf(«r)d«r  . 

-tt  0 

From  (5)  one  gets  the  operator  equation 


(6) 


-b  0 = JF(0,  t = -X  0),  0 « C‘  . 

’ q ’ a 


This  operator  equation  has  the  following  form 

00 

(7) 


bx  = T,  (\(q)x  + P M.  (q)xk),  x € Y , 


k=l 
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where  b,  p > 0,  C < q < qj  < 1 are  real  parameters;  Nj(q),  Mj(q)  are 
linear,  completely  continuous  operators  In  the  Banach  space  Y,  contin- 
uous functions  of  qt  [0,q.]  in  the  norm  topology  of  Y;  the  maps 

* 

q -*  N (q)x  , q -*  M (q)x  k are  continuous  for  all  q e [0,q.  ] = I , for 
k k k k ^ 

||x||<r,  k = 2,  3,  . . . ; Nk<q)x,  M^(q)x  are  k -linear  with  respect 

to  x;  and 


(7*) 


S 

k=l 


sup 


qt  I 


lNk(q) 


sup 


k = l 


q<  I 


|Mk(q) 


< 00 


Theorem  1.  Suppose 

(8)  For  all  qt  I,  the  linearized  problem 

bx  = Nj(q)x  + pMj(q)x 

has  a simple  eigensolution  x = x^,  P = Pq  which  is  continuously 
dependent  upon  q t I ; 

$ 

(9)  There  Is  a linear,  continuous  functional  yt  Y , so  that  the 
following  holds  for  all  q < I ; 


bx  = Nj(q)x  + pqM1(q)x  + PqX 

Implies  x = 0 where  P^x  = y(x)x^,  y(x^)  = 1 ; 

(10)  Y(RqM1(q)Xq)  * 0,  V q « I . 

(Here  s (b  • id  - N^(q)  - PqMj(q)  - Pq)  1 exists  according  to  the 
Riesz-Schauder  theory.  For  N^(q)  = 0,  (10)  follows  from  (8),  (9)  .) 

Then  there  are  numbers  rQ,  eQ,  sQ  > 0,  so  that  for  given  s with 
0 < |s|  < sQ  (7)  has  exactly  one  solution  (x,  p)  with 

11*1  < r0,  y(*)  = s,  I p - pq  I < c0  • 


for  each  q * I . This  solution  Is  analytic  with  respect  to  the  small 
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parameter  s : 

“ k °°  k 

(11)  x = sxn  + Z s x p = P0  + E ek  8 » = °«  V k > 2 , 

q k=2  q k=l 

where  x,  p,  x , and  e,  are  continuous  functions  of  q . 

K K 

The  proof,  which  uses  the  method  of  Lyapunov-Schmidt  is  given  in 
[34].  Theorem  1 is  closely  related  to  a general  theorem  by  Crandall  and 
Rabinowitz  [11].  The  relationship  to  simple  algebraic  eigenvalues  is  dis- 
cussed in  [38]. 

Introducing  the  Fourier  coefficients 

ak^  5 2tT  / t(1,  cr)  cos  k<r  do-  , 


bk<  ■Q)  s J -6(1,  <r)  sin  k<r  dcr  , 

-TT 

it  is  easy  to  verify  the  assumptions  (8),  (9)  and  (10)  for  a variety  of 
classes  of  permanent  waves  because  the  effect  of  all  of  the  above  op- 
erators on  the  Fourier  coefficients  is  explicitly  known.  One  then  ob- 
tains existence  proofs  for  sinusoidal  waves  ([34]). 

The  following  is  a description  of  a method  for  the  explicit  con- 
struction of  (11).  Given  that  p = p^  + e , (7)  is  equivalent  to 

°°  k k 

(12)  x = sx  + eR  Mx+R  Y N (q)x  + (p  + e)M  (q)x  , 

M H A M ]£_2  K M K 

and  (11),  (12)  imply  that 

s2x2  = s2(£iRqMiXq  + H(Xq))  . 

Then  y(x2)  = 0 9lves 

= -y(H(xq))/y(RqM1xq)  . 
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This  permits  to  be  determined.  The  coefficients  x , e etc. 
are  obtained  in  the  same  manner. 

Application  of  this  method  to  our  problem  (6)  yields 


00 

(13)  -6(1, cr)  = s • sinner  + V ^ , (<r)sk 

k=Z 


where 


00 

are  trigonometric  polynomials,  p = p + ^ 

Q k=l 


k 

s , 


and  pq  , 


^k’  ek  may  calculated  explicitly. 

In  [38]  a number  of  other  methods  of  calculating  (11)  without  solv- 
ing the  bifurcation  equation  are  given. 

The  case  of  infinite  depth  capillary-gravity  waves  gives  a problem 
which  is  analogous  to  (5)  for  which  becomes  the  unit  circle,  i.  e. 

0 < z < 1 . Using  the  continuous  dependence  of  Theorem  1 in  q,  it  is 
possible  to  show 


(capillary-gravity  waves  of  depth  h < <*>) 

-*  (capillary-gravity  waves  of  depth  h = °°) 

as  h -*  oo  in  the  norm  of  C”  . 

a 

The  study  of  the  limit  process  b -*  0 (surface  tension  p -*  0)  is 
far  more  difficult  because  the  structure  of  the  operator  equations  (6),  (7) 
then  changes.  However,  it  is  possible  to  show 

ek  (caPlllarY-gravity  waves  p * 0) 

-*  $k»  e)<  (gravity  waves  p - 0) 

as  p -*  0 in  the  norm  of  C"  . Only  the  coefficients  in  (13)  converge  in  this 

Cl 

case. 

The  existence  proof  for  cnoidal  waves  (q  -*  1)  has  the  following  under- 
lying idea.  In  oo 

(7)  k>x  =Yj  (Nk(q)xk  + PMk(q)xk)  , 


for  large  p we  set 
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q = exp(-l/p),  b = y/p,  P , 

with  y = constant  and 


a = K(k)/irp  . 


It  is  possible  to  show  that 

Sp  2 (b  • id  - Nj(q)  - pM^q))'1 

exists  for  y » 1/3  . Then  (7)  implies 

00 

<14)  X = Sp(E  Nk(q)xk  +PMk(q)xk) 

with  the  asymptotic  structure 


(15) 


2 v -k„  m 

x = p L p skmx  • 

k>l.  m>2 


Therefore  making  the  ansatz 


-1  -2 
X = p U + p V , 


one  obtains  the  following  nonlinear  equation  for  u : 


(16)  u = Sl2uZ  • 

If  u is  known,  then  the  equation  obtained  for  v is 

v = A(p_1,  v)  . 

This  equation  may  be  solved  using  Banach's  fixed -point  theorem. 

For  this  procedure  to  succeed  it  is  essential  that  the  nonlinear 
equation  (16)  may  be  solved  and  that  asymptotic  expansions  do  exist. 
Demonstration  of  the  existence  of  such  asymptotic  expansions  is 
rather  difficult  in  the  case  of  cnoidal  waves,  especially  if  consideration 
is  given  to  the  surface  tension.  This  is  due  chiefly  to  the  fact  that  the 
operators  involved  are  closely  related  to  singular  Integral  operators. 
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Moreover  an  existence  proof  is  complicated  by  (14)  having  a much  more 
complex  structure  than  (15)  (cf.  [35]). 

Littman’s  proof  of  the  existence  of  cnoidal  waves  ([20]),  which 
does  not  take  the  surface  tension  into  account  results  in  a boundary  value 
problem  for  the.  strip  0 < ijj  < 1 . The  small  parameter  a in  (2)  results 
from  the  use  of  Friedrichs'  "stretching  of  the  independent  variable" 
method  ([13]). 


§ Capillary -Gravity  Waves  of  Two  Superposed  Fluids 

Two  fluids  are  considered  here,  the  conditions  being  identical  to 
those  referred  to  in  §1  above. 

Propagating  at  the  interface 
is  a permanent  wave,  propa- 
gation being  influenced  by 
both  the  gravitational  force 
and  the  surface  tension. 

After  conformal  mapping  two 
boundary  value  problems 
(which  are  analogous  to  (5)) 
are  obtained  for  two  reguired  analytic  functions 


Figure  2 


")«)» = YV + 1 VV’ 


V 


ia 

Pje 


J = 1,2 


in  two  annuli  in  the  t,j -plane  and  in  the  "Plans*  What  is  peculiar 
about  this  problem  is  that  it  is  necessary  to  determine,  not  only  w (I,  ) , 
but  also  the  unknown  relationship  between  the  angular  coordinates, 


namely,  <r2  = g(o-j)  . 

The  method  of  proof  consists  of  reducing  the  problem  to  an  operator 
equation  along  Pj  = 1 for  (6j(l,  tr j),  g(o-j))  and  involving  Theorem  1 . To 
verify  (7')  it  is  necessary  to  use  sharp  estimates  for  singular  integral 
operators,  which  are  given  in  [34].  Also,  the  tools  provided  in  [34]  en- 
able other  such  coupling  problems  to  be  dealt  with  effectively. 

In  1928,  Kotschin  ([16]),  with  the  aid  of  a complicated  method  of 
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majorants  was  able  to  demonstrate  the  existence  of  gravity  waves  of  the 
form  shown  in  Figure  2 above.  The  method  of  proving  the  existence  of 
gravity  and  capillary-gravity  waves  given  in  [34]  is  much  simpler. 

The  existence  of  capillary-gravity  waves  is  quite  interesting  from 
a physical  point  of  view.  Whereas  in  the  case  of  gravity  waves  the  up- 
per fluid  must  always  be  lighter  than  the  lower  fluid,  the  reverse  is  also 
possible  in  the  case  of  capillary-gravity  waves. 

§ 3.  Tidal  Waves 

We  will  consider  here  a plane  dynamic  tidal  model  in  an  x - y co- 
ordinate system  in  which  the  Moon  is  nonrevolving,  the  Earth  is  rotating 
at  an  angular  velocity  to,  and  the  curve  r = r (o)  of  the  ocean  surface 
is  fixed  in  an  equatorial  channel  (Figure  3;  r,  a are  polar  coordinates). 


Figure  3 

In  order  to  obtain  a stable  model,  it  is  necessary  to  use  the  square 
approximation  of  the  Moon's  gravitational  potential.  This  involves 
using  an  anti-Moon  at  x = -x^  and  halving  the  effect  produced 
by  the  two  moons.  Also,  assume  the  ocean  fluid  be  inviscid,  incompres- 
sible, and  homogeneous,  and  let  the  motion  be  an  irrotational  motion  or 
have  a general  vorticity  distribution. 
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Of  greatest  interest  is  the  surface  curve  r = r(a)  . The  exact  proof 
of  existence  shows  that  r(o)  has  a maximum  at  P and  a minimum  at  Q 
(Figure  3).  This  is  in  agreement  with  what  is  actually  observed  in  nature 
where  the  tidal  wave  does  not  arrive  until  the  Moon  has  passed  through 
the  zenith.  While  this  effect  is  qualitatively  correctly  obtained  in  our 
model,  the  amplitudes  of  our  tidal  waves  are  too  small  as  compared  with 
actual  measurements.  However,  this  is  not  surprising  since  our 
tidal  model  (which  was  originally  proposed  by  H.  Beckert  [9])  represents 
an  exact  formulation  of  Airy's  channel  theory  of  tides  in  straight-line 
channels,  where,  to  a first  approximation,  the  effects  referred  to  above 
can  also  be  seen  to  occur. 

H.  Beckert  ([9])  was  able  to  give  a topological  existence  proof  for 
irrotational  tidal  waves.  A constructive  existence  proof  will  be  outlined 
here  for  general  vorticity  distributions  ([34],  [5]).  In  this  context  we 
will  also  study  the  much  more  complex  case  of  natural  oscillations  oc- 
curring without  a perturbing  body  (i.  e.  a moon)  being  present  (i.  e.  tidal 
waves  on  planets  having  no  moons  revolving  around  them).  This  involves 
the  following  remarkable  alternative;  There  are  critical  angular  veloci- 
ties such  that; 

- for  to  4 ojj  there  exist  tidal  waves  in  the  presence  of  perturbing 
bodies; 

- in  the  neighborhood  of  there  exist  natural  oscillations  in  the 
absence  of  perturbing  bodies. 

To  explain  the  existence  of  spring  and  neap  tides,  it  is  necessary  to  al- 
so consider  the  effect  of  the  Sun  located  at  the  point  (x  , 0)  for  a 
spring  tide  or  (0,y  ) for  a neap  tide. 

u 

For  the  velocity  vector  (u,  v)  we  have  the  differential  equations 

u + v =0,  u - v = cf(  Jj)/R 
x y ’ y x T,/ 

v = and  the  boundary  con- 


for  the  stream  function  ijj  with  u = v|j  , 
ditions 
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f f1*  r + kUM  + ^ US  + P0  + 2 *uZ  + = const- 

\ + = 0 at  r = r(  a) 

i|j  = -4>0  = const,  at  r = R 

where  and  are  respectively  the  Moon's  and  Sun's  potentials, 

pQ  is  the  pressure,  p.  is  the  density,  R is  the  radius  of  the  Earth,  f 
is  the  given  vorticity  distribution,  and  k is  a constant. 

After  a quasiconforma!  mapping  the  following  problem  for  the  annu- 
lus 

Kq  s ^ ~ pei<7:  0 < q < P < 1}  • 

is  obtained. 

For  a given  vorticity  distribution  f e C [q,  1],  0 < y < 1,  we  find 

two  functions  with  Holder  continuous  derivatives 

»,T.  cW(iCq)  , 

which  satisfy  the  integro-differential  equations 


0 

P 


— = 0 («,  T,  T ) 


T + 

P 


cr 

P 


+ 2t 


= 0 («,T,«  ) 

L.  O’ 


r 2 

exp  tq(p)  = J p(f(p)/q  )dp  + 1 > 0 , 

q 


the  boundary  conditions 
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Tt  - \ T f tfd<r  = (a,^  ± a_)(cos  2cr  + 0.  ( -6,  t)) 

J q Mo  1 

+ (2  + \)  0 (■©,  t)  at  p = 1 

= 0 , at  p = q 


as  well  as  the  symmetry  conditions 

TT 

tf(p,  -<r)  = -0(p,(r),  t(P, -tr)  = T(p,<r),  f T(q,o-)dcr  = 0 . 

-IT 

Here,  0j,  0^  denote  first  - and  second  order  terms,  respectively, 

1 17 

Tg(cr)  = g(cr)  - — f g(<r)dcr  , 

-IT 

a.„  are  small  parameters  describing  the  small  perturbations  due  to 
M S 

the  Moon  and  Sun,  respectively,  and  X.  is  an  eigenvalue  parameter 
which  is  dependent  upon  the  Earth's  angular  velocity. 

The  structure  of  the  problem  is  as  follows 

(17)  F(x,  \)  = 0,  x = («,  t)  € C^(Kq)  X C^(Kq)  . 

The  existence  of  natural  oscillations  where  a . = a = 0,  i.  e.  in  the 

O 

absence  of  the  Moon  and  Sun  leads  to  a bifurcation  problem  and  may  be 
treated  using  Theorem  1.  What  is  rather  difficult  here  is  to  verify  the 
simplicity  of  the  eigenvalues.  This  is  carried  out  by  showing  that  the 
eigenfunctions  have  the  following  structure 

tf(p,<r)  = b(p)  sin  n<r,  X.  = X.  , n = 1,  2,  . . . 

and  that  (b(p),  X.^)  satisfies  a second-order  differential  equation  that 

is  dependent  on  n . X.  * X for  n * m then  follows  from  nodal  prop- 

n m 

erties. 

The  existence  of  tidal  waves  in  the  presence  of  both  the  Moon  and 
Sun  may  be  proved  by  means  of  the  implicit  function  theorem  because  for 


218 


E.  ZEIDLER 


\ * X 


,-l. 


n>  n = 1,  2,  . . . the  inverse  operator  (0,  \)  exists  in  (17). 


The  solutions  have  the  following  structure 


z 


k,  m,  p 


k m P „ 
a a s ; 

M S kmp 


= z 

k,  m,  p 


k m p 
a ._a0  s t, 

M S kmp 


(p  s o when  perturbing  bodies  are  present  ). 

For  irrotational  waves  (f  2 0),  , t,  may  be  constructed  explicit - 

" kmp'  kmp 

ly  (see  [34]).  In  the  general  case  (f  ^ 0),  •£»,  , r,  are  obtained 

1 ’ kmp’  kmp 

through  the  solution  of  ordinary  differential  equations  by  means  of  an 
iterative  method  (see  [5]). 

The  method  outlined  above  was  used  in  [39]  to  prove  the  existence 
of  sinusoidal  capillary-gravity  waves  of  the  type  referred  to  in  §1  and 
characterized  by  general  vorticity  distributions.  However,  it  should  be 
noted  that  the  method  described  in  [5]  has,  in  a number  of  points,  sev- 
eral improvements  over  that  given  in  [39]. 

Further  information  about  capillary-gravity  waves  in  circular  chan- 
nels is  given  in  [34]. 

§ 4.  Flow  Around  a Gas  Bubble 

We  are  interested  here  in  the  existence  of  a plane  gas  bubble  in  a 
steady-state  flow  of  an  inviscid,  incompressible,  and  homogeneous 
fluid,  which  at  infinity,  may  be  seen  to  be  superimposed  on  a parallel 
and  circulatory  flow.  Also  acting  is  the  force  of  gravity.  What  is  re- 
quired is  the  shape  of  the  gas  bubble.  (Figure  4). 


Figure  4 
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y 


After  a conformal  mapping  one  has  the  following  problem- 
analytic  function  w = 3 + It  in  the  unit  circle  K £ {£  = pel<r, 
with  the  boundary  condition  at  p = 1 


Find  an 
0<  p<l} 


* dT  -8  2 8 

' — = -1  + Pe  + (Z  + C coso-)  e + 
da 


-•8  r -'6 

+ G e J e sin(T(a)  + a)da 
0 

and  the  symmetry  conditions 


8(p,  -a)  = fl(p,a),  T(p,  -a)  = -T(p,a),  f 3(l,a)da  = 0 . 

-IT 

P,  Z,  C,  G are  constants. 

This  problem  may  be  treated  using  the  method  outlined  in  §1  above 
(Theorem  1)  (cf.  [ 37 ][ 34 ] ) . For  previous  studies  of  special  cases  see 
[21],  [3]. 

§ 5.  Open  Problems 

Problems  which,  in  the  author's  opinion,  have  not  yet  been  studied 
include  the  following  ones; 

- Treatment  of  the  problem  in  §4  as  a three-dimensional 
problem; 

- studies  of  exact  three-dimensional  tidal  models; 

- demonstration  of  the  existence  of  solitary  waves  with  the 
surface  tension  taken  into  consideration  (Figure  5,  cf.  [13]) 


<— 

Figure  5 

Figure  6 

Figure  7 
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- demonstration  of  the  existence  or  nonexistence  of  cnoidal  waves 
with  due  consideration  of  the  surface  tension  for  h > hQ  (cf.  (3)) 
(proof  of  nonexistence  of  infinite  depth  solitary  waves  has  been 
produced  by  Krasovski  [18]); 

- demonstration  of  the  existence  of  highest  waves  of  permanent 
type  (Figure  6,  Figure  7;  the  wave  crest  is  not  smooth  and  has 
an  angle  of  magnitude  2tt/3,  cf.  Yamada  [32,  33]; 

- Krasovski  [17]  showed  that  there  is  a permanent  gravity  wave  of 
infinite  depth  for  every  a;  0 < a < tt/6,  with  the  angle  of  the 
tangent  to  Yq(x)  being  equal  to  a in  one  point.  Does  this  re- 
sult also  hold  for  capillary-gravity  waves? 

- It  is  recommended  that  a study  should  be  made  to  investigate  the 

global  structure  of  branches  of  solutions  for  permanent  waves  in 

$ 

the  sense  of  the  basic  alternative  of  Rabinowitz  [2  3]. 

- What  is  the  range  of  values  of  the  parameter  s for  which  the 
solution  expansion  (12)  converges  (cf.  [2 S]) ? 

- In  the  case  of  capillary-gravity  waves  there  are  critical  wave- 
lengths to  which  double  eigenvalues  are  known  to  correspond. 

It  is  recommended  that  a study  should  be  made  to  prove  the  ex- 
istence of  waves  in  this  case  (cf.  [34]). 

> 

In  conclusion,  the  author  wishes  to  thank  the  organizing  committee 
and  its  chairman,  Professor  Paul  H.  Rabinowitz,  for  the  invitation  to  this 
most  interesting  symposium. 
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The  Finite  Amplitude  Dynamics  of 
Baroclinic  Waves 

Joseph  Pedlosky 


1 . Introduction . 

Solar  heating  is  the  ultimate  energy  source  for  the 
motion  of  both  the  atmosphere  and  the  oceans;  lunar  tidal 
oscillations  (relatively  weak)  excepted.  The  radiant  en- 
ergy emitted  by  the  sun  may  vary  slightly  over  very  long 
periods,  but  a sensible  meteorological  idealization  con- 
sists of  holding  the  solar  source  strength  fixed.  The  ro- 
tation of  the  earth  therefore  produces  a diurnal  variation 
of  heating  and  the  orbital  progress  of  the  earth  produces 
seasonal  change.  Nevertheless  the  movement  of  the  atmo- 
sphere as  evidenced  by  the  phenomenon  of  weather  reveals 
the  existence  of  motions  at  spatial  and  temporal  scales 
not  directly  imposed  by  these  astronomically  fixed  varia- 
tions of  solar  heating.  A similar  examination  of  oceanic 
motions  has  shown  the  presence  of  very  energetic  fluctua- 
tions^ whose  horizontal  spatial  and  temporal  scales  (of 
order  300  km  and  weeks  to  months,  respectively)  are  not 
directly  imposed  by  either  solar  heating  or  the  atmospher- 
ic wind  forcing  which  combine  to  produce  the  general 
oceanic  circulation. 

It  is,  however,  easy  (in  principle)  to  imagine  a dy- 
namical state  of  the  atmosphere  and  ocean,  consistent  in 
every  respect  with  basic  physical  principles  in  which  all 
change  is  predictable  with  the  appropriate  astronomical 
period,  in  which  each  season  is  identical  to  its 

^ The  so-called  "meso-scale"  eddies. 
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predecessor,  and  such  that  an  almanac  of  the  past  might 
serve  as  a predictor  of  the  future.  Although  such  a dynam- 
ical state  might  be  consistent  with  physical  principle,  it 
is  not  the  state  which  is  realized  in  nature.  Instead  we 
see  an  ocean  and  atmosphere  with  a rich  spectrum  of  motions 
churning  and  seething,  all  to  the  delight  of  poets  and  the 
near  despair  of  those  responsible  for  prediction. 

It  is  natural  to  take  the  view  that  the  simpler,  peri- 
odically repetitive  state  of  the  atmosphere  and  ocean  is 
hydrodynamically  unstable  with  respect  to  the  realized 
state  and  that  the  observed  fluctuations  are  manifesta- 
tions of  the  instability  of  simpler,  more  serene  states  of 
motion.  It  is  not  the  purpose  of  this  lecture  to  pursue 
the  full  implications  of  this  view  (the  reader  is  referred 
to  the  excellent  monograph  of  Lorenz  for  an  interesting 
discussion) . Instead,  I shall  assume  its  validity  and 
here  discuss  some  questions  related  to  the  theory  of  sta- 
bility of  atmospheric  and  oceanic  currents. 

The  pioneering  studies  of  Charney  (1947)  and  Eady 
(1949)  showed  that  large  scale  atmospheric  (and  oceanic) 
flows  with  realistic  distributions  of  velocity  would  be- 
come unstable  with  respect  to  disturbances  which  release 
the  available  potential  energy  associated  with  the  nearly 
ubiquitous  presence  of  horizontal  density  gradients  in 
such  flows.  The  basic  explanation  for  the  instability  may 
be  easily  described. 

Consider  a fluid  under  the  influence  of  gravity  (g) 
which  possesses  both  vertical  (parallel  to  g)  and  horizon- 
tal density  variations.  For  example, 

(1.1)  p = pQ (y , z)  . 

For  the  moment  suppose  8pQ/3z  and  3p  /3y  are  constant. 

Then  the  lines  of  constant  density  will  be  inclined  to 
the  horizontal  as  shown  in  Figure  1.  Normally  such  a 
state  is  not  in  equilibrium.  Such  inclined  density  sur- 
faces would  tend  to  rotate  into  coincidence  with  horizon- 


THE  FINITE  AMPLITUDE  DYNAMICS 


227 


tal  planes  (equi-potential  gravitational  surfaces)  giving 
rise  to  motion  in  the  y-z  plane.  If,  however,  the  entire 
fluid  is  rotating  about  the  z axis  with  angular  velocity 
ft,  this  density  state  is  a possible  equilibrium  state  in 
the  presence  of  a fluid  flow  in  the  x direction  (into  the 
paper)  which  leaves  the  density  surfaces  undisturbed. 

Such  a rectilinear  flow  in  the  x direction  gives  rise 
to  a Coriolis  acceleration  in  the  positive  y direction, 
which  in  equilibria  is  balanced  by  a pressure  variation 
with  y,  viz: 

(1.2)  p(2ft)u  = - 

dy 

while  in  the  vertical  direction  the  pressure  gradient  is 
balanced  by  buoyancy, 

(1.3)  pg  - - H 


Consistency  of  (1.2)  and 
d.4a)  2ft  | [ 1 


(1.3)  requires  that 
(3z/9y) p 
Oz/3y)  ^ 


or 

(1.4b)  2ft  = + 2.(|£) 

3 z p 3y'  p 


Hence,  when  density  surfaces  and  pressure  surfaces  do  not 
coincide  the  vertically  varying  Cor iol is  force/unit  mass, 

2ft  3u/3z,  can  balance  the  pressure  forces  that  in  a non- 
rotating system  would  give  rise  to  circulations  which  tend 
to  align  the  surfaces  of  density,  pressure  and  geopotential. 
Since  the  velocity  required  to  produce  the  Coriolis  force 
is  perpendicular  to  the  density  gradient,  this  density 
state  is  a dynamic  equilibrium  state.  Such  states  are 
called  baroclinic  and  they  occur  frequently  in  the  atmo- 
sphere, where,  for  example,  the  increase  of  the  westerly 
winds  with  height  balances  the  poleward  increase  of  density 
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and  in  the  ocean  where  the  surface  intensification  of  the 
Gulf  Stream  is  observed  to  occur  in  regions  of  sharp  hori- 
zontal density  variations. 

Consider  now  the  vertical  displacement  of  a fluid  ele- 
ment from  position  A in  Figure  1 to  another  position,  say 
B.  For  simplicity  suppose  that  the  fluid  element  preserves 
its  density  during  the  displacement.  When  the  fluid  ele- 
ment arrives  at  B it  will  have  a density  differing  from  its 
surroundings  by  an  amount 

3 p 3p 

d.5)  6p  = - (n  + c_|) 

where  n and  £ are  the  elemental  displacements  in  the  y and 
z directions,  respectively.  The  Archimedean  buoyancy  force 
in  the  vertical  direction  per  unit  mass  is  - g 6p/p  which 
when  projected  on  the  virtual  trajectory  gives  rise  to  a 
force  in  the  direction  of  motion  equal  to 

(1.6)  -g— sine  = Sin20{l  - ^)p/tan6H 

where  0 is  the  angle  the  trajectory  makes  with  the  horizon- 
tal and  9.  is  the  length  of  the  virtual  displacement.  In 
almost  all  natural  situations  of  interest  the  fluid  is 
gravitationally  stable  to  vertical  displacements,  i.e. 
3p^/3z  < 0,  i.e.  if  the  fluid  element  is  displaced  verti- 
cally to  point  C in  Figure  1,  it  experiences  a restoring 
force  proportional  to  displacement  and  returns  to  its 
equilibrium  position.  However,  (1.6)  reveals  that  if  the 
element  is  displaced  so  that  its  trajectory  satisfies 

(1.7)  0 < tan0  < (3z/3y) 

P 

the  resulting  force  in  the  direction  of  motion  is  positive 
and  the  fluid  element  will  further  accelerate  away  from 
equilibrium.  Referring  to  Figure  1,  it  is  evident  that 
this  requires  fluid  trajectories  to  lie  within  the  wedge 
defined  by  the  horizontal  geopotential  surface  and  the 
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surface  of  constant  density.  Such  a wedge  opening  exists 
only  due  to  the  role  of  rotation  in  the  first  place  and 
consequently  the  resulting  instability  is  peculiar  to  ro- 
tating fluid  systems  like  the  ocean  and  atmosphere.  Since 
the  instability  hinges  on  the  non-coincidence  of  density 
and  geopotential  or  pressure  surfaces,  the  instability  goes 
by  the  name  "baroclinic"  instability.  It  is  also  clear 
from  the  above  discussion  that  the  instability  occurs  when 
light  fluid  rises  or  dense  fluid  sinks  within  the  "wedge 
of  instability".  This  releases  potential  energy  associated 
with  the  horizontal  density  gradient  of  the  basic  current. 
The  instability  is  therefore  a form  of  convection,  and 
highly  slanted,  for  in  the  atmosphere  or  oceans  the  slope 
of  the  density  surfaces  is  quite  small . 

The  truly  major  contribution  of  the  early  investiga- 
tors was  their  demonstration  that  the  instability,  de- 
scribed crudely  above,  manifests  itself  as  a wavelike  dis- 
turbance of  the  pressure,  density  and  velocity  fields  with 
wavelengths,  frequencies  and  spatial  structure  that  in 
many  ways  match  the  observed  "weather  waves"  in  the  at- 
mosphere. The  theory  of  baroclinic  instability  then 
became  a central  concept  in  understanding  the  nature  of  the 
atmospheric  general  circulation  and  recent  and  future 
oceanic  field  experiments  will  determine  whether,  as  sus- 
pected, it  is  of  similar  importance  in  the  ocean. 

An  implicit  philosophical  assumption  of  the  early  in- 
stability theories  was  that  although  the  dynamical  state 
without  the  waves  is  unstable,  the  instability  is  suffi- 
ciently weak  so  that  the  deduced  linear  instabilities  would 
bear  a recognizable  structural  connection  to  the  observed 
fluctuations,  rather  than  leading  to  a catastrophic  transi- 
tion to  a fully  turbulent  state  of  motion.  In  many  ways, 
i.e.  wave-length,  frequency  and  geographical  distribution, 
such  expectations  have  been  realized.  This  fact  leads  nat- 
urally to  the  next  stage  of  theoretical  development.  That 
is,  the  description  of  this  important  instability  phenomena 
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Fig.  2.  Schematic  rendering  of  the  two-layer  model 
model  for  baroclinic  waves. 
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in  finite-amplitude  is  needed  so  that  the  amplitude 
limiting  mechanism,  the  effect  of  the  instability  on  the 
general  circulation,  and  the  interaction  with  other  modes 
of  motion  can  be  described.  The  goal  of  such  description 
is  the  development  of  a conceptual  foundation  which  helps 
understand  the  results  of  geophysical  observation,  numeri- 
cal and  laboratory  experiments  (which  this  lecture  gives 
little  opportunity  to  describe) , and,  in  turn,  helps  us  to 
understand  the  bifurcated  dynamical  state  in  which  we  find 
the  atmosphere  and  ocean. 

2 . The  model. 

Much  of  the  recent  theoretical  and  experimental  work 
on  finite-amplitude  dynamics  of  baroclinic  waves  has  uti- 
lized a physical  model  of  elegant  simplicity.  The  model, 
first  introduced  by  Phillips  (1951)  is  the  two-layer  model 
shown  in  Figure  2.  Two  incompressible,  homogeneous  fluids 
of  slightly  different  densities  lie  horizontally  on  a plane 
rotating  with  angular  velocity  The  vector  is  anti- 

parallel to  gravity  g.  The  fluid  is  contained  in  an  in- 
finitely long  channel  whose  width  is  L.  In  this  model 
the  baroclinicity  (effectively  the  horizontal  density  gra- 
dient) is  limited  to  a narrow  interface  between  the  two 
layers.  The  fluid  is  considered  to  have  a small  enough 
kinematic  viscosity  v,  so  that  the  effects  of  viscosity 
are  limited  to  thin  viscous  boundary  layers  at  the  upper 
and  lower  rigid  lids  and,  in  some  studies  (Hart  1973,  1976) 
also  at  the  interface. 

The  depth  of  each  layer  in  the  absence  of  motion  is  D 
. 2 2 

and  it  is  assupied  that  L /gD  is  sufficiently  small  so 
that  the  centrifugal  curvature  of  the  interface  may  be 
neglected . 

Scales  U,  L,  L/U,  D are  used  to  non-dimensionalize 
the  horizontal  velocity,  horizontal  lengths,  time  and  depth 
respectively.  If  the  parameters  z = U/2QL  (Rossby  number) 
and  E = v/ (ID  (Ekman  number)  are  both  small,  it  can  be  shown 
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that  to  lowest  order  the  motion  is  quasi-geostrophic 
(Phillips  1963) . That  is,  to  order  e and  E the  Coriolis 
force  due  to  the  horizontal  velocity  is  balanced  by  the 
horizontal  pressure  gradient  while  the  vertical  pressure 
gradient  balances  buoyancy  forces.  Under  these  circum- 
stances it  can  be  shown  (Pedlosky  (1970))  that  the 
governing  equations  of  motion  are  the  equations  for  the 
0(1)  vorticity  in  each  layer  viz: 

(2.1)  jjL  (V2*n  + By)  = (-l)n  F ^ (ip2  - r V2^ 


n = 1,2 

where  the  subscripts  1,  2 refer  to  variables  defined  in  the 
upper  and  lower  layers,  respectively.  In  (2.1)  ^n(x,y,t) 
is  the  non-dimensional  dynamic  pressure  field,  which  due  to 
the  quasi-geostrophic  balance  between  Coriolis  and  pressure 
forces,  acts  as  a stream  function  for  the  horizontal  velo- 
city seen  relative  to  the  rotating  frame.  That  is 

3iJj 

(2.2)  u = - dip  /3y,  v = ^ 

n Yn  n 3x 


where  u and  v are  the  velocities  down  the  channel  (x  di- 
n n 

rection)  and  across  the  channel  (y  direction) . The  verti- 
cal component  of  vorticity  is 


(2.3) 


3v 

r = — - 
'n  3x 


3u 


= ( 


T>*n 


3y  3x‘  3y‘ 

It  is  important  to  note  that  within  each  homogeneous  layer, 
the  horizontal  velocities  are  independent  of  height  (1.4a). 
In  deriving  (2.1)  it  has  been  assumed  that  the  vertical 
vorticity  of  the  rotating  frame,  2S^,  is  linearly  increasing 
in  the  y direction.  This  approximation,  first  introduced 
by  Rossby  (1939) , can  be  shown  to  contain  the  essential 
dynamical  effects  of  the  earth's  sphericity  if  y is  asso- 
ciated with  the  northward  directed  co-ordinate.  Thus  (2.1) 
states  that  in  each  layer  the  rate  of  change  of  the  total 
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vorticity  (£  which  is  the  vorticity  relative  to  the  earth 
plus  By,  the  variable  frame  or  planetary  vorticity)  is  due 
to  two  effects.  The  first  term  on  the  right  hand  side  of 
(2.1)  is  the  production  of  vorticity  by  the  stretching  of 
fluid  columns  that  is  produced  by  the  motion  of  the  inter- 
face. Its  effect  is  equal  and  opposite  in  the  two  layers. 
The  second  term  represents  the  destruction  of  vorticity  by 
viscous  dissipation  in  the  viscous  boundary  layers. 

The  non-dimensional  parameters  appearing  in  (2.1)  are 


B = 


l2(  ^ 
L ( dy 


)/U,  sphericity  factor 


F = 4Q2L2/g 


( P2_P1 ) 


stratification  parameter 


r = E /e,  viscosity  measure. 

For  a spherical  earth  df/dy  = 2ftcos0/a  where  0 is  latitude 
and  a,  the  earth's  radius.  B,  F and  r are  constants  in  the 
geostrophic  limit,  and  the  nature  of  the  stability  problem 
and  the  associated  finite  amplitude  dynamics  depend  on 
their  relative  magnitudes.  In  (2.1)  they  are  assumed  to  be 
0 (1 ) . or  smaller . 

The  appropriate  boundary  conditions  for  (2.1)  are 
(Pedlosky  1970) 


(2 . 4a , b) 


1 im 

X+oo 


^n 

3x 


f 32* 

n 

3t3y 

-X 


0 


dx 


0 


y = o,  i 
y = o,  l 


The  fundamental  non-linearity  in  (2.1)  is  provided  by 
the  fact  that  in  each  layer 

,,  d _ 3 . 3 3 3 *^n  3 ^n  3 

(2'5)  art'3t  + un37+vn37=3t“3SrS7+33r3^- 

So  far,  almost  all  the  finite  amplitude  studies  have 
considered  the  instability  and  subsequent  bifurcated  states 
associated  with  the  mean  flow 
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(2.6)  C’  " - V 

where  represents  a y independent  flow  in  the  x direction 
which  is  different  in  each  layer.  The  constant  slope  of 
the  interface  in  the  basic  state  is  proportional  to  - U2 
and  provides  the  available  potential  energy  for  the 
instabil ity . 

The  total  stream  function  is  given  by 

(2.7)  = ~ Uny  + 

from  which  it  follows  that  the  4 satisfy 

n 1 

3<p 

(2.8)  (lt  + Un  fe)(v2<t’n+(-1)n(<,,l-<(>2))  + (B'("1)nF(Ul"U2))3-xJ1 


= - r V2<)>n  - J (4>n , V2<|>n  + (-1) n (4>1-4>2)  ) 


where  the  Jacobian  operator  is  defined 

(2.9) 

for  any  two  functions  (f ,g) . In  addition 


-lf  . . 3f  3g  3f  3g 
J(f'9>  = Six  3y  ~ ^Ty  3x 


(2.10) 


H. 


3x 


= 0, 


1 im 
X-*°° 


x a2*. 


3y3t 


dx  = 0 


-X 


y = 0,1 


With  only  minor  alterations  the  same  model  is  appro- 
priate for  the  study  of  laboratory  flows  (Hart  1972,  1973, 
1974)  in  cylindrical  geometries  and  to  the  investigation  of 
continuous  velocity  profiles  in  the  case  of  constant  shear 
and  zero  sphericity  (Drazin  1970,  1972)  . 

3 . Linear  instability. 


If  the  disturbance  has  a small  amplitude,  at  least 
initially  a 1 inear  stability  analysis  will  be  valid.  If 
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the  last  term  in  (2.8)  is  then  ignored,  wave  solutions  of 
(2.8)  which  satisfy  (2.10)  can  be  found  in  the  form 


(3.1) 


= R 


ik(x-ct) 
e sm 


miry 


The  eigenvalue  c is  given  by  the  relation 

2 

(3.2) 


c = 


(a  +F ) r q 2 /f  i 

* 16  + lr  a /k} 

a (a^+2F) 


+ 1 4F2  ( B+ir  a2/k)2  - (1^  ~u2)  2 ( 4F2-a4  ) a4 } V2a2  (a  2+2F) 


where 

(3.3)  *2/Al 

(3.4) 


2 

a +F 


(6+F(U1-U2) ) 
(UL-c) 


. 2 
la  r 

kF  (l^-c) 


2 ,2  2 2 

a = k + m tt  . 


y and 


Instability  occurs  where  c.,  the  imaginary  part  of  c,  is 
positive . 

The  theoretical  development  of  the  non-linear  problem 
depends  crucially  on  examining  cases  in  which  the  amplitude 
of  the  wave,  although  finite,  remains  small.  This  in  turn 
requires  that  the  analysis  be  limited  to  that  part  of 
parameter  space  where  the  linear  growth  rate  (kc^),  is 
small  compared  to  the  advective  time  scale  kU . An  impor- 
tant mathematical  question  then  arises.  As  the  neutral 
boundary  (i.e.  the  surface  in  parameter  space  on  which  c. 
is  zero)  is  approached,  do  both  roots  (3.2)  have  vanishing 
imaginary  parts  or  only  one?  That  is,  is  the  neutral  curve 
defined  as  a coalescence  of  two  roots  (3.2)?  Consider  the 

case  r = 0 (inviscid  instability).  The  shear  flow  will  be 
2 

unstable  when  a < 2F  and 

(3.5)  (U1-U2)2  > 4F2B2/(4F2-a4)a4  = U2(a2) 


Ui  - 02  - Uc(l  + A) 


For 

(3.6) 
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where  A <<  1 


(3.7) 


ci  = + 6F(2)*5  A**/a  2 (a2  + 2F) 


Hence  in  the  invisc id  limit  the  neutral  curve  is  defined  by 

a coalescence  of  two  roots;  c.  has  a branch  point  at  A = 0 

1 w 

and  the  growth  rate  is  proportional  to  A . On  the  other 

2 

hand,  when  6 = 0,  r ^ 0,  instability  requires  a <2F  and 


(3.8) 


- . 2 2 ..2  _ 

(u,-u2)2  > — = u2 

1 2 ( 2F-a2 ) c 


When  U.-U_  = U (1+A),A<<  1 
1 i c 

either 

U a (2F-a2) 15 

(3.9a)  c.  = — A 

2 (a2+F) 


or 

(3.9b) 


c . 
1 


2 (a2+F) 
(a2+2F) 


r/k. 


The  stability  curve,  U^-l^  = Uc*  is  therefore  a tran- 
sition curve  for  one  root  of  (3.2)  in  this  case  (3.9a), 
while  the  second  root  (3.9b)  remains  0(1)  and  negative. 
Hence,  one  root  (3.9b)  represents  rapid  decay  on  an  0(1) 
time  scale  while  only  the  root  (3.9a)  yields  slow  ampli- 
tude evolution  at  a rate  proportional  to  A.  This  dis- 
tinction between  the  inviscid  case  (coalescing  roots)  and 
the  viscous  case  (distinct  roots)  is  crucial  to  the  non- 
linear theory. 

4 . Finite  amplitude  theory. 


The  finite  amplitude  theory  of  baroclinic  waves  pivots 
about  the  analysis  of  the  dynamics  for  those  supercritica- 
lities A,  which  are  small.  This  restriction  is  required 
so  that  the  amplitude  of  the  wave  at  equilibration  remains 
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sufficiently  small,  though  finite,  that  a perturbation 
theory  appropriate  for  weak  non-linearity  is  valid.  Small 
A also  implies  long  development  times  compared  to  the  ad- 
vective  time  scale.  It  is  useful  to  recognize  this  expli- 
citly by  introducing  a new,  "slow"  time  variable 

(4.1)  T = o(A)t 


where 

(4.2)  o ( A)  = |A|j 


The  exponent  j depends  on  whether  the  marginal  curve  is  a 
transition  point  for  one  or  two  roots  of  the  dispersion 
relation  as  discussed  in  section  (3).  If  the  marginal 
curve  is  a coalescence  point  j = h (3.7),  if  not  j = 1 
(3.9a).  The  absolute  value  is  used  in  (4.2)  to  allow  an 
examination  of  sub-critical  states.  The  disturbance  stream 
function  4>  is  treated  as  a function  of  the  two  time  vari- 
ables t and  T so  that  in  (2.8) 

(4,3)  ft  ft  + a(A)  If 

In  each  of  the  cases  <|>  is  developed  in  the  asympto- 
tic series 


(4.4) 


= A 


(1) 


aU, 


(2) 


3/ 2 , (3) 


where  <|>  ^ is  written  in  the  form 

(4.5)  <f>  ^ ^ = A (T)e^^x  c^sinmiTy 

n n 


where  c(k,m)  is  the  real  phase  speed  on  the  neutral  curve. 

- Uj  = Uc(k,m).  The  substitution  of  (4.3),  (4.4)  into 

(2.8)  leads  to  a sequence  of  linear  problems,  the  first  of 
which  yields  the  marginal  wave,  i.e.  c(k,m)  and  y(k,m). 

The  suppression  of  secular  terms,  which  would  otherwise  in- 
validate (4.4)  for  times  T = 0(1)  in  the  subsequent  higher 
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order  problems  determines  both  the  equation  for  the  ampli- 
tude evolution  and  the  non-linear  correction  to  the  mean 
field.  The  reader  is  referred  to  Pedlosky  (1970)  for  de- 
details  of  the  calculations. 

4 . a . Viscous  equilibration 


When  8 = 0 and  r = 0(1)  so  that  (3.9a)  and  (3.9b)  yield 
the  slightly  supercritical  growth  rates,  the  exponent,  j, 
in  (4.2)  is  1.  The  amplitude  equation  is 

(4.6)  ^ = {kc  . - ik  )A  - k2NA|A|2 

dT  01  | A | 


where 
(4 . 7a  , b) 


r8 (a  +F) 

’ ' ose  solution  is 


2 k 

_A_  M2Fra2)_2 

01  | A | 2 (a2+F) 

{ 4m2n2 (a2-F)  + 3a  2 ( 2F-a2 ) } >0 . 


(4 .3a, b) 


A 


A (T) |elk 


where 

| A (T) I 2 = I A (0 ) I 2 e2kcoiT/(l  + I A ( 0)  | 2 (e2kcoiT-l)) 

|AJ2 

The  amplitude  monotonically  rises  from  its  initial 
value  A(0)  and  monotonically  attains  its  final,  steady 
equilibrated  limit 

<4-9>  I Ax,  1 2 = ( ^ ) 

There  are  several  important  features  of  the  solution. 
First,  as  mentioned  above,  the  amplitude  evolution  is  mono- 
tonic. The  amplitude  rise  is  inexorable  and  the  rise  time 
is  determined  by  kcQ . , (4.7a),  and  this  growth  rate  is 
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identical  to  that  given  by  linear  theory  (3.9a).  The  sole 
new  element  of  the  finite  amplitude  theory  in  (4.6)  is  the 
introduction  of  an  amplitude  dependence  to  the  effective 
growth  rate,  viz:  (kcQ^  “ k N | A | ).  The  Landau  constant, 

N,  is  the  essential  fruit  of  the  non-linear  calculation. 

For  the  baroclinic  problem  N is  positive  which  implies  i) 
only  supercritical  instability  and  ii)  a steady  final  am- 
plitude given  by  (4.9).  A major  conceptual  weakness  of  the 
theory  is  its  inability  in  particular  cases  to  determine 
a' priori  the  sign  of  N.  At  the  present  time  this  requires 
tedious  and  lengthy  calculation. 

Drazin  (1972)  considered  the  finite-amplitude,  viscous 
(r  = 0(1)),  extension  of  Eady's  problem.  In  that  problem 
both  the  density  and  velocity  fields  are  continuous  func- 
tions of  height.  The  shear  is  constant  as  is  the  vertical 
density  gradient.  It  was  already  known  that  the  linear 
theory  appropriate  to  this  problem  gave  results  qualita- 
tively similar  to  the  two-layer  problem  and  Drazin  showed 
that  the  amplitude  equation  for  the  continuous  problem  was 
identical  to  (4.6),  with  only  quantitatively  different  re- 
sults for  cQi  and  N.  The  Landau  constant,  N,  was  also 
positive . 

4.b.  Inviscid  equilibration. 


The  strictly  inviscid  problem  (r  = 0)  possesses  a 
critical  shear  Uc(a),  given  by  (3.5).  In  this  case  the 
stabilizing  dynamical  constraint  that  is  overpowered  at  the 
instability  threshold  is  an  inviscid  vorticity  constraint 
due  to  the  sphericity  (B)  effect.  For  small  supercriti- 
cality  A,  c^  is  given  by  (3.7)  and  the  index  j in  (4.2) 
is  *}.  The  resulting  amplitude  equation,  Pedlosky  (1970)  is 

(4.10)  = k2c2  . A - k 2NA  ( | A | 2 - | A ( 0 ) | 2 ] 

dT  01 


2 2 

2g  FZ A_ 

a4(a2+2F)2  | A | 


where  now 
(4 . 11a) 
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(4.11b) 


(4.11c) 


N 


2 2 -1 
(2m  7i  F) 


’(B+FU  )m2-n2U 
c_ c 

8 (a2+2F)  (Uj^-c)  2 


r 2 2 2 2 2 

a ( 2F-a  j +4in  tt  (a  -F) 

+ (2F-a4)(4tan^^-l2^\2  ) 

L /F/2  2ra  it  +F 

U 2 2 2 

c = u2  + -y  - 3 (a  +F)/a  (a  +2F) 


and  where  A(0)  is  the  initial  value  of  A(T) . 

In  this  problem  as  in  the  viscous  case  the  mechanism 
for  equilibration  is  the  same,  i.e.  the  production  of  a 
correction  of  the  mean  shear  flow,  to  reduce  its  shear  as 
the  wave  grows  and  depletes  the  energy  source  for  the  in- 
stability. An  examination  of  (4.10)  shows,  however,  that 
the  inviscid  amplitude  behavior  is  distinctly  different. 
Instead  of  monotonic  equilibration  to  a steady  state 
(4.10)  yields  an  oscillation.  Equation  (4.10)  can  be 
thought  of  as  an  equation  for  a mass-spring  oscillator 
whose  potential  is  repelling  at  small  amplitudes^  and  re- 
storing (N>0)  for  large  amplitudes. 

In  the  case  when  A is  real  and  (1/A  dA/dT)m  . = c . 

' T=0  oi 

the  maximum  amplitude  during  the  oscillations  is  given  by 


(4.12)  A2ax  = A2  ( 0 ) + ^ (1  + ^-  A2(0))*5 

C0i 


which  occurs  at  an  amplitude  in  excess  of  the  amplitude 
(4.13) 


A2  = A2  (0) + c2./N 


at  which  the  instantaneous  mean  shear  flow  appears  stable . 


t 

In  reality,  for  amplitudes  close  to  the  initial  amplitude. 
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Thus  because  of  the  "inertia"  associated  with  the  inviscid 
dynamics,  the  finite  amplitude  wave  experiences  its  largest 
amplitudes  during  that  part  of  its  cycle  of  perpetual 
growth  and  decay  when  the  instantaneous  mean  flow  would  be 
stable  to  small  disturbances.  The  dnoidal  oscillation  is 
described  by  the  elliptic  function,  viz: 

kA  N*5 

(4.14a)  A (T)  = A dn{  ( — — (T-t)],kJ 

max 

where 

(4.14b)  t = /2[dn_1{  tc } | /kN^A 

/\  max 


(4.14c) 


k = (A 


max 
2 


A2.  )^/A 
min  max 


min 


a2(0) 


oi 

N 


oi 


(1 


- if  a2(0) 

coi 


A crucial  step  in  the  inviscid  problem  occurs  in  the  linear 
2) 

problem  for  <()  ‘ . A solvability  condition  arises  at  that 

order  in  the  form 


(4.15) 


dA 

dT 


3+FU  _ 6-FU 

+ Y £_=■ 

(l^-c)2  (U2~c)  2 


0 


For  A to  indeed  vary  on  the  time  scale,  the  brack- 

et in  (4.15)  must  vanish,  as  it  does  in  the  inviscid  case. 
Indeed  (4.15)  is  reminiscent  of  Rayleigh's  flex  point 
theorem  in  ordinary  shear  flows.  For  r = 0(1),  the  equiv- 
alent bracket  does  not  vanish  which  requires  T to  be  scaled 
for  temporal  variations  on  the  A-1  scale.  In  fact,  (4.15) 
is  merely  a condition  for  the  coalescence  of  the  two  roots 
for  c on  the  linear,  inviscid  marginal  curve. 

4.c.  Slightly  viscous  equilibration. 


It  is  natural  to  investigate  what  the  fate  of  the 
oscillation  described  in  section  (4.b)  would  be  in  the 
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presence  of  a small  amount  of  dissipation.  To  investigate 
this,  it  is  useful  to  recast  the  stability  problem  such 
that  for  a given,  k,  and  r,  the  parameter  F is  con- 

sidered the  critical  one.  In  the  case  6=0,  the  threshold 
value  for  F for  instability  is 


(4.16) 


F = ^ + 
c 2 


, 2 2 
2r  a 


k2(U1-U2) 


F is  then  augmented  by  an  amount  A and  r is  chosen  to  be 
This  last  setting  is  guided  by  the  results  of 


linear  theory  so  that  the  viscous  decay  rate  (r)  will  be 

u 

of  the  same  order  as  the  inviscid  growth  rate  (A  ) . 

It  then  follows 'that  A satisfies  (Pedlosky  (1972)) 


(4.17a) 


d2A 


dT 


2 + f 


: dA 
“*5  dT 


_A_ 

A I 


k2(Ul"U2) 

4a2 


Ak^(U1-U2) 

_ 2 
2a 


m 7T 


3 2<J> 

dy  sin2miry  — = 0 

3y 


(4.17b) 


3 

3T 


3 2<J>  2 

~ - a <1> 

ay 


r 32<fc 
I A | h 3y2 


2 

a miT 
2 (U1-U2) 


11*11 

dT 


2r 


sin2miry . 


In  (4.17a),  T = | A | ^ t , while  the  function  <!>  in  (<s.l7a,b)  is 
the  correction  to  the  x-independent  mean  flow  shear.  When 
the  flow  is  inviscid,  r = 0,  (4.17b)  can  be  integrated 

immediately.  The  resulting  equation  for  A is  identical  in 
form  to  (4.10)  although  the  particular  parameters  have  dif- 
ferent values.  The  inviscid  problem  was  also  investigated 
in  the  case  of  continuous  shear  by  Drazin  (1970),  who  de- 
rived a similar  amplitude  equilibration  equation. 

When  r/|A|^  is  very  large  (i.e.  the  dissipation  time 
is  short  compared  to  the  linear  e-folding  instability  time) 
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(4.17b) 
2 


2 2 

becomes  an  algebraic  equation  linking  3 <I>/3y  with 


|A|“.  On  the  other  hand  (4.17a)  is  approximated  by  the 
singular  perturbation  resulting  from  dropping  the  second 
time  derivative  in  (4.17a)  when  r/ | A | ^ becomes  large.  The 
resulting  first  order  equation,  identical  in  form  with 
(4.6),  matches  asymptotically  the  equation  derived  under 
the  assumption  r = 0(1)  in  the  limit  r -*•  0 . This  coinci- 
dence of  limits  shows  that  the  Landau  equation  (4.6)  is  a 
singular  perturbation  of  the  more  general  problem  (4.17a,b) 
valid  for  large  dissipation  (r  = 0(1)).  As  the  dissipation 
decreases  the  equation  takes  on  more  the  character  of  the 
inviscid  problem  in  section  (4.b). 

Nevertheless,  for  all  r ^ 0,  no  matter  how  small,  a 
steady  solution  exists  for  A,  namely 


(4.18) 


(s)  2 


(VU2> 

a2mV 


The  question  naturally  arises  as  to  the  relationship 
of  the  inviscid  problem  and  its  oscillatory  behavior  and 
the  steady  solution  (4.18),  whose  wave  amplitude  is  in- 
dependent of  r (for  small  r) . 

To  answer  this  question  I (Pedlosky,  (1971),  (1972)) 

relaxed  the  boundary  conditions  on  <J>  from  the  vanishing  of 
3 4> / 3 y on  y = 0,1  to  the  vanishing  of  $ on  those  boundary 
points.  By  both  analogue  computer  calculation  and  analytic 
asymptotic  analysis,  the  following  picture  emerged.  For 
r = 0 the  solution  executes  a simple  closed  trajectory  in 
the  phase  plane  (Figure  3).  For  r/  | A | ^ 0(1),  the  solution 
oscillates  several  times  but  eventually  settles  down  to  the 
steady  solution  (4.18)  (Figure  4).  For  r/ | A | ^ <<  1 however, 
the  solution  seldom  becomes  static.  Instead  the  solution 

continuously  oscillates.  The  steady  wave  solutions  become 

. 2 2 2 
linearly  unstable  for  small  enough  r and  for  k >3m  n . Un- 
steady solutions,  therefore,  are  expected  there.  For 
2 2 2 

k <3m  it  , the  steady  wave  amplitude  solutions  are  linearly 


stable,  but  calculation  shows  that  they  are  unstable  to 


I 

I 


Fig.  5.  The  phase  space  trajectory  (in  perspective)  for 
u 

r/A  <<1.  The  wave  amplitude  perpetually  oscillates  in  a 
limit  cycle. 
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finite  amplitude  perturbations,  which  lead  again  to  un- 
steady oscillatory  behavior.  The  form  of  the  oscillation 
is  identical  with  one  of  the  class  of  inviscid  solutions 
of  (4.17),  but  the  particular  solution  is  chosen  from  the 
infinite  set  of  possible  solutions  by  viscous  integral 
constraints.  The  reader  is  referred  to  Pedlosky  (1971) 
for  the  details.  The  resulting  oscillation  is  a limit 
cycle,  i.e.  the  final,  persistent  oscillation  is  indepen- 
dent of  initial  conditions  in  contrast  with  the  inviscid 
solution  (4.14a).  The  phase  plane  (space)  representation 
is  shown  in  (Figure  5)  . 

Smith  (1974)  criticized  the  alteration  of  the  bound- 
ary condition  in  the  calculation  of  4>.  He  redid  the  prob- 
lem presented  by  (4.17a,b)  in  an  elaborate  calculation 
wherein  <1>  is  expanded  in  a Fourier  cosine  series  to  satis- 
fy the  correct  boundary  condition  on  . The  resulting  in- 
finite set  of  coupled  differential  equations  for  A and  the 
amplitudes  of  the  Fourier  modes  representing  $ were  trun- 
cated and  integrated  numerically.  Smith's  results  (1976) 
reproduce,  essentially,  the  results  of  the  simpler  theory 
outlined  above.  This,  no  doubt,  occurs  because  of  the 

fact  that  the  steady  wave  solutions  have  their  instability 

2 k 

preferred  for  large  a and  small  r/ | A | , for  which  (4.17b) 

becomes  in  the  "interior  approximation",  an  ordinary 
differential  equation  in  T for  <t>  with  y carried  parame- 
tr ical ly . 

Hart  (1973,  1974,  1976)  has  carried  out  an  important 
and  interesting  series  of  laboratory  experiments.  His 
experiment  consisted  of  a rotating  cylinder  containing 
two  immiscible  fluid  layers.  He  redid  the  caculations 
leading  to  (4.17a,b)  taking  into  account  the  effects  of 
the  cylindrical  geometry,  interface  friction  and  the  cen- 
trifugal slope  of  the  interface.  These  alterations  do  not 
affect  the  structure  of  the  amplitude  equation  which  he 
derived  (Hart  1974)  for  small  supercriticalities  in  the 
parameter  F.  Hart  found  that  for  small  supercriticalities 
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and  fairly  small  r (-.05)  a steady  wave  regime  occurred  in 
the  experiment.  The  amplitude  of  the  observed  steady  wave 
was  in  good  agreement  with  the  theoretical  prediction  which 
is  basically  a refinement  of  (4.18)  suitable  for  the  ex- 
perimental situation.  Hart  was  unable  to  reduce  the  ex- 
perimental setting  for  r to  small  enough  values  to  examine 
the  theory  near  the  neutral  curve  (for  such  small  r the 
viscous  boundary  layers  on  the  horizontal  surfaces  become 
turbulent) . Significantly,  if  he  kept  r constant  but  in- 
stead increased  A sufficiently  (A  = 0(1)),  a limit  cycle 
oscillation  qualitatively  similar  to  that  predicted  for 
small  A and  smaller  was  found.  Apparently,  larger  super- 
criticalities, beyond  the  limits  of  validity  of  the  present 
theory  of  (4.17a,b),  qualitatively  reproduce  the  limit  r-*-0. 
Hart  (1976)  has  also  produced  a challenging  diagnostic  nu- 
merical model  in  which  certain  non-linear  interactions  are 
arbitrarily  suppressed  but  which  nevertheless  reproduces 
substantially  his  experimental  observations. 

5 . F i nite  Ampl it ude  Dynamics  as  a function  of 

super  critical ity . 


One  of  the  features  of  the  finite  amplitude  dynamics 
that  is  usually  inaccessible  to  weakly  non-linear  theory  is 
the  alteration  of  the  amplitude  dynamics  as  the  supercriti- 
cality increases  to  such  a degree  that  higher  thresholds  of 
instability  are  crossed.  These  linear  thresholds  are 
usually  inaccessible  since  they  generally  correspond  to 
order  one  supercriticalities  above  the  lowest  threshold. 

In  an  effort  to  examine  this  question,  I studied  the  case 
8=0,  r = 0(1)  (for  which  (4.6)  is  relevant  for  small 
supercriticalities)  when  F>>1  (pedlosky  (1976)).  The 
virtue  of  this  parameter  setting  is  that  for  the  k corre- 
sponding to  the  minimum  critical  shear,  the  stability 


threshold  for  the  m 
(5.1)  A = 


th 


cross  stream  mode  is  given  by 

--iii/r1. 


(UrU2-Uc»  . (m 
- Am  - 


U, 


2/2 
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Therefore  for  large  F,  such  that  FA*5  = 0(1),  small 
supercriticalities  will  penetrate  several  modal  thresholds 
for  instability.  This  is  significant  as  a device  for 
examining  higher  supercriticality  dynamics  within  the  sys- 
tematic framework  of  the  asymptotic  theory.  The  method  of 
analysis  is  similar  to  that  given  in  Section  4 with  one 
crucial  difference.  The  modal  structure  in  y is  not  de- 
termined av priori  by  the  linear  marginal  stability  problem. 
Rather,  the  modal  structure  is  itself  linked  to  the  non- 
linear dynamics. 

With 

(5.2)  - A(y,T)eikx 


it  can  be  shown 
equation  (t  = AF 

(5. 3a, b) 


that  A satisfies  the  partial  differential 
^ r E/2) 


3A 

~ 2. 

3 A 

2 

. 2 

s A ( 1 +u ) 

8y 

3u 

3 2u  . 

32  . A | 2 

3 T 

3y2 

2 1 A 1 
ay 

where  u(y,t)  is  the  correction  to  the  mean  zonal  shear. 
Both  A and  u vanish  on  y = 0,1.  Linear  instability  is 
assured  whenever  the  supercriticality  s,  related  to  A by 


(5.4) 


2 

s 


(AF*5  - — — j-)  ; 
4F  \ 


0(1)  ; 


exceeds  imr.  For  small  supercriticality, 


(5.5) 


S = TT  + 10,  CO  < < 7T 


only  the  first  stability  threshold  is  crossed.  In  this 
case  it  can  be  shown  (Pedlosky  (1976))  A (y , t ) - Aq ( t ) sinuy 
and  the  equation  for  Aq  is  given  by  the  Landau  equation 


(5.6) 


dA 

c 

dr 


n 3 7T  „ . | 2 

= 10A  - —t—  A A 

o 4 0 0 


as  in  (4.6). 


The  problem  of  higher  supercriticality 
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Fig.  6.  Bifurcation  diagram  for  steady  wave  solutions 
r = 0(1),  (U-Uc)Uc  F*3  - 0(1),  as  a function  of  shear 
supercriticality  s. 
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corresponds  to  s = 0(1)  so  that  several  thresholds  are 
crossed . 

The  bifurcated  steady  solutions  of  (5.3a,b)  for  s = 
0(1)  are  given  by  the  non-linear  modes. 


(5.7) 


■My)  = AQsn(2jK(v)y,v) 

j = 1,2,3... 


where 

(5.8a,b)  Aq  = v//v2+l  2X^+1  K(v)  = s/j 


Hence,  from  (5.8b)  the  modulus  v of  the  elliptic  func- 
tion describing  the  jt^'  mode  is  a function  of  the  super- 
criticality s as  is  the  amplitude  Aq  by  (5.8a).  The  analy- 
tic bifurcation  diagram  is  shown  in  Figure  6.  It  is  impor- 
tant to  note  that  only  a small  neighborhood  of  the  first 
bifurcation  point  is  accessible  to  the  traditional  finite 
amplitude  theory.  Furthermore,  the  modal  shape  is  a func- 
tion of  s.  Rather  than  being  dominated  by  a single  Fourier 
harmonic  each  non-linear  mode  contains  a mixture  of  many 
Fourier  modes.  For  any  s>2ti  more  than  one  steady  mode  is 
possible,  but  of  course  not  simultaneously,  since  the 
superposition  principle  for  non-linear  modes  is  not  valid 
(perhaps  regrettably) . Numerical  methods  are  required  to 
integrate  the  initial  value  problem.  Figure  7 shows  the 
result  of  such  a calculation.  The  function  ^ would  be 
the  first  Fourier  component  of  the  first  non-linear  steady 
mode  while  <t>2  would  be  the  first  Fourier  component  of  the 
second  non-linear  mode.  In  general  for  the  numerical  cal- 
culation the  truncated  representation 


(5.9) 


( A 1 

N 

k£i 

VT)' 

I u 

uk(T). 

sinkny 


was  used  to  integrate  (5.3a,b).  What  Figure  7 shows  is 
that  after  a non-mo no tonic  oscillatory  period  the  solution 
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asymptotes  to  the  first  non-linear  steady  mode,  even  though 
the  initial  spectrum  ($  cc^)  lies  close  to  the  second  non- 
linear mode.  The  results  of  several  calculations  demon- 
strated the  model's  preference  for  that  non-linear  mode 
(j=l)  that  has  the  largest  steady  state  amplitude.  It  is 
furthermore  significant  that  although  the  approach  to  the 
steady  amplitude  is  non-oscillatory  for  small  supercriti- 
cality, it  becomes  increasingly  oscillatory  for  larger 
supercriticality. 

6 . Other  problems. 

The  work  described  in  the  above  sections  all  relates 
to  the  evolution  of  a single  non-linear  wave.  The  problem 
that  arises  when  more  arbitrary  initial  disturbance  spectra 
are  considered  is  still  largely  untouched.  In  the  case 
r = 0(1)  the  argument  can  plausibly  be  made  that  other 
modes  in  the  initial  spectrum  aside  from  the  unstable  modes 
will  quickly  damp.  This  to  some  extent  at  least  rational- 
izes the  restricted  initial  condition  usually  considered. 
The  situation  is  far  from  clear  for  the  inviscid  problem 
where  stable  modes  are  merely  neutral  rather  than  decaying. 

An  interesting  start  on  this  problem  was  made  by 

Loesch  (1974).  He  considered  the  problem  with  r=0 , 0^0  for 

2 

which  a minimum  critical  shear  occurs  at  a = /2F,  U^-l^  = 
0/F.  He  considered  an  initial  spectrum  composed  of  three 
waves  when  the  mean  flow  shear  slightly  exceeds  its  minimum 
critical  value.  The  three  waves  formed  a resonant  tr iad 
wherein  one  wave  was  the  slightly  unstable  wave  while  the 
other  two  were  waves  which  are  linearly  stable  on  the  mean 
flow  but  resonantly  interact  with  the  unstable  wave. 

If  (k^,m^,  u>^ ; (T)  ) are  the  x wave  number,  the  y 

modal  number,  the  (real)  frequency  and  the  amplitude  res- 
pectively of  the  unstable  wave,  while  (k  , m.,  w . , A.), 

i l 1 1 

i=2,3,  are  those  for  the  neutral  waves,  then  to  form  the 
resonant  triad 


THE  FINITE  AMPLITUDE  DYNAMICS 


255 


(6.1) 


+ k2  + 1<3  = 0 


W1  - w2  — 


in  2 = 0 


ml  i m2  + = 0 

where,  recoil,  each  wave  is  of  the  form  A_.e^^jx  u j inm^  Try. 
Loesch  showed  that  the  amplitudes  satisfied  the  equations 


( 6 . 2a , b ,c) 


a dA.  ^ ^ 

dT  ( dT  lNlA2  A3} 


= k2  C2  A1-  k2  NA1(|A1|2  - | A ( 0 ) | 2 ) 


k2NN. 

* -V-»i  ( 


N, 


I A ( 0)  I 2 !A,|2-|A  (0) |2 

— t + — £ } 

N 


dA2  * * 
dT“  = lN2A3Al 


dA3  * * 

3t~~  = iN3AlA2 

where  N,  N3,  are  numbers  calculated  from  the  usual  sup- 
pression of  secularity  condition.  The  problem  posed  by 
(6.2a)  combines  elements  of  the  finite  amplitude  theory  and 
resonant  interaction  theory  and  allows  the  efficient  trans- 
fer of  energy  from  an  unstable  wave  to  waves  that  are 
stable  on  the  mean  flow  by  linear  theory.  Loesch  found  nu- 
merically for  F>10.5,  regardless  of  the  initial  amplitude 
ratios  in  the  triad,  that  the  "neutral"  waves  reached  an 
amplitude  as  large  as  the  "unstable"  wave  by  the  non-linear 
transfer  of  energy  between  triad  members. 

Pedlosky  (1975)  examined  a similar  case  for  very  large 
F where  the  only  equilibration  mechanism  for  the  primary 
unstable  wave  was  its  resonant  interaction  with  neutral 
waves  and  suggested  this  mechanism  as  a possible  explanation 
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for  meso-scale  oceanic  eddies. 

This  review  would  certainly  be  incomplete  without  men- 
tion of  the  pioneering  work  of  Lorenz  (1963) . The  work  is 
extensive  and  a complete  review  is  beyond  the  scope  of 
this  paper.  Lorenz,  fundamentally,  considered  a truncated 
Fourier  representation  of  each  dependent  variable  in  a dy- 
namical model  similar  to  (2.1)  but  one  which  allows,  in  a 
basically  ad  hoc  (but  energy  preserving)  way  for  time 
variations  in  F (the  static  stability) . The  truncation 
level  for  each  variable  is  drastic  and  it  is  surprising, 
in  a sense,  how  effective  such  severe  truncations  can 
still  be  in  reproducing  qualitatively  the  vacillation, 
steady  wave  and  no-wave  regimes  found  experimentally. 

Since  the  truncation  is  not  related  to  any  formal  para- 
metric ordering  it  is  difficult  to  evaluate  the  theoreti- 
cal to  a" priori  validity  of  the  theory.  As  in  Hart's  nu- 
merical work,  the  striking  qualitative  success  of  Lorenz's 
model  serves  as  a challenge  to  more  formal  theories  to 
achieve  as  much. 
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Bifurcation  and  Stability  Problems  in 
Astrophysics 

Norman  R.  Lebovitz 


1 . Introduction 

Astrophysics  is  a rich  source  of  stability  problems, 
and  this  is  especially  true  of  that  branch  of  the  subject 
dealing  with  stellar  structure  and  evolution.  This  is  not 
surprising,  since  a star  is  a fluid  mass,  and  therefore 
subject  to  the  classical  hydrodynamical  instabilities. 
Moreover,  the  mechanical  and  thermal  balances  peculiar  to 
stars  create  further  instabilities  in  certain  parameter 
ranges,  and  these  can  interact  with  hydrodynamical  effects. 
The  net  result  is  a somewhat  bewildering  variety  of  insta- 
bility mechanisms.  Good  reviews  of  the  theory  of  stellar 
stability  are  available  [18,  19],  however. 

Two  kinds  of  stellar  instability  problems  are  discussed 
in  this  article.  They  were  selected  because  the  corres- 
ponding bifurcation  problems  have  particularly  important 
implications  for  astrophysics.  Despite  their  particular 
importance,  their  choice  for  discussion  here  is  inevitably 
somewhat  personal,  and  there  are  many  other  important  bi- 
furcation problems  in  the  theory  of  stellar  structure  and 
evolution,  not  to  mention  in  other  areas  of  astrophysics 
[10]  . 

The  first  problem  discussed  is  intimately  linked  with 
the  general  problem  of  turbulence  in  fluid  dynamics,  and 
may  therefore  be  extremely  formidable.  Nevertheless,  it 
seems  worthwhile  to  state  it  clearly  as  a bifurcation  pro- 
blem because  (a)  it  arises  naturally  in  this  way,  yet  (b) 
does  not  seem  to  have  been  expressed  as  such  in  the  astro- 
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physical  literature.  The  second  problem  (or  class  of  pro- 
blems) addresses  the  following  fundamental  question  in 
astronomy:  what  happens  to  a self-gravitating  mass  endowed 

with  angular  momentum  when  it  undergoes  (as  it  must)  an 
enormous  contraction? 


2 . The  Equations  of  Stellar  Structure 

A star  is  a gaseous  mass  and  is  therefore  described  by 
the  equations  of  fluid  dynamics.  The  form  that  these  equa- 
tions take  must  reflect  certain  characteristic  physical 
assumptions  peculiar  to  stars:  (1)  The  gravitational  field 
acting  on  the  star  is  due  to  matter  in  the  star,  (2)  Vis- 
cous effects  are  normally  assumed  neglibile,  (3)  Energy  is 
dissipated  through  radiation,  and  replenished  by  nuclear 
reactions,  or  by  gravitational  contraction.  The  equations 
of  conservation  of  momentum,  mass,  and  energy  may  be  written 

Dv  1 

(1)  Dt=~p  grad  p + grad  V' 

(2)  5t  ~ -p  div  -X' 

and 

(3)  T§f  = “£■  div  q + e. 


where 


D 

Dt 


(4) 


3 

3t 


+ v-grad 


V (x,t) 


and 


p (x‘ ,t) 


x-x 


dx'  . 


In  these  equations,  v is  the  fluid  velocity,  p the  density, 
p the  pressure,  \)  the  gravitational  potential  (expressed 
by  the  equation  (4) , where  G is  the  gravitational  constant 
and  R is  the  region  occupied  by  the  fluid) , s the  entropy 
per  unit  mass,  q the  radiative  flux  vector,  and  e the 

/V/ 

rate  of  nuclear  energy  production.  For  most  purposes  it  is 
sufficient  to  write  [33J 


(5) 


q = -Q  grad  T, 
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for  a certain  function  Q(p,T),  so  the  radiative  transport 
of  heat  has  the  same  local  properties  as  material  conducti- 
vity . 

The  expression  of  Q as  a function  of  (say)  p and  T 
is  only  one  of  several  constitutive  relations  needed  to  make 
equations  1-3  determinate.  One  also  needs  e expressed 
in  terms  of  p and  T,  and  an  equation  of  state  similarly 
expressing  p and  s.  All  these  quantities  depend  as  well 
on  the  chemical  composition  of  the  matter.  We  shall  assume 
for  the  present  that  the  latter  is  constant.  The  correct 
expression  of  these  constitutive  relations,  expecially  those 
for  Q and  e,  is  a complicated  physical  problem  which  we 
shall  assume  solved. 

What  further  conditions  are  needed  in  specifying  a solu- 
tion of  equations  <l)-(3)?  Initial  conditions  on  v,  p, 
and  s (or  T)  are  needed,  of  course,  but  these  are  not 
enough.  The  specification  of  a star  is  normally  supposed 
to  require  giving  its  mass  M and  chemical  composition. 

The  latter  having  been  assumed  fixed,  we  still  have  for  the 
former  the  integral  constraint 

(6)  / p (x,t) dx  = M. 

JR 

A second  integral  constraint  that  may  be  imposed  without 
loss  of  generality  is  that  the  center  of  mass  remain  at  the 
origin  of  coordinates: 

(7)  J p (x,  t)  x.dx  =0  (k  = 1,2,3). 

R k ~ 

Finally,  there  are  boundary  conditions  to  be  satisfied  on 
the  surface  S bounding  R (the  latter  defined  as  the  set 
of  points  where  p > 0) . Let  S have  the  equation 

F(^,t)  = 0 (defines  S) . 


Then  we  may  take  as  boundary  conditions: 
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(8) 


on  S . 


Whether  the  initial-boundary  value  problem  expressed  by 
equations  ( 1 > — ( 8 ) is  mathematically  well-set  is  not  known, 
at  least  in  the  generality  with  which  the  problem  has  been 
stated.  It  is  a free-surface  problem,  i.e.,  the  function 
F defining  S is  not  given  a priori  but  is  to  be  discov- 
ered; this  only  increases  the  complexity  of  an  already  com- 
plex problem.  Other  difficulties  of  the  formulation  given 
above  could  be  pointed  out,  but  let  us  pass  instead  to  the 
simpler  problems  that  have  in  fact  been  successfully  treat- 
ed by  astrophysicists:  the  construction  of  spherically 
symmetric  equilibrium  configurations. 

Most  of  the  effort  in  the  theory  of  stellar  structure 
has  been  directed  toward  constructing  models  for  which 
v = 0 and  p,  p,  etc.  depend  only  on  r,  the  distance  from 
the  origin.  If  we  define  the  luminosity  Mr)  by  the 
formula 


(9)  l (r)  = 4irr2qr  (r)  , 

then  we  find  in  place  of  equations  (1) , (3) , and  (5) 


(10) 

P'  (r) 

(11) 

M (r) 

and 

(12) 

T'  (r) 

with  boundary  conditions  l 
T = 0 at  r = R.  Here  R 
to  be  discovered  along  with 


= P (r)  (/'  (r)  , 

= 4frr2p(r)e, 

= - [4TTr2Q]  (r)  , 

=0  at  r = 0 , p = 0 and 
is  not  given  a priori,  but  is 
p,  H , and  T.  In  place  of 


equation  (4)  one  has,  in  virtue  of  Newton's  theorem  on 
spherical  distributions  of  mass, 


(13) 


V'  (r) 


2 

s p (s) ds  , 
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which  can  be  substituted  for  l/'(r)  in  equation  (10).  A 
more  convenient  system  of  equations  is  obtained  by  replacing 
the  independent  variable  r by  m,  where 

r 

(14)  m = 4 tt  J s p(s)ds. 

o 

Then  one  finds  the  system  of  four  equations 


(15) 

dr  _ 
dm 

[ 4irr2p  ] 

-1 

t 

(16) 

= 

dm 

-[47rr4]' 

_1Gm, 

(17) 

d£  _ 
dm 

e , 

(18) 

dT 

dm 

- [ ( 47rr2 ) 

2 -1 
i PQ] 

With 

p , e , and  Q regarded  as 

functions 

these  form  a nonlinear  system  of  four  ordinary  differential 
equations  for  (r,p,£,T),  with  the  two-point  boundary  con- 
ditions 

(19)  r = 0,  i.  = 0 at  m = 0 ; p = 0 , T = 0 at  m = M. 

Hidden  in  these  equations  is  a dependence  on  chemical 
composition.  For  given  mass  and  chemical  composition,  one 
finds  (in  practice,  numerically)  a solution  of  the  problem 
(15)— (19) . To  construct  an  evolutionary  sequence,  we  now 
recognize  that  the  chemical  composition  must  change  in 
virtue  of  the  nuclear  reactions.  Since  it  changes  on  a 
timescale  longer  than  other  natural  timescales  appearing  in 
the  problem,  it  is  reasonable  to  suppose  that  the  equili- 
brium equations  continue  to  apply  (cf . Sec.  8) . The  pro- 
cedure used  by  astrophysicists  to  calculate  the  "next" 
equilibrium  model  in  an  evolutionary  sequence  is  to  use  the 
present  model  to  estimate  (according  to  known  rules)  the 
change  in  chemical  composition  during  an  appropriate  time 
interval,  and  then  use  the  altered  chemical  composition 
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in  equations  (15) -(19). 

The  physical  considerations  of  this  section  may  be 
found  in  fuller  form  in  a number  of  books,  e.g.,  references 
[3] , [6] , and  [33] . 

3.  Convective  Instability  and  Convective  Equilibrium 

As  one  follows  an  evolutionary  family  of  equilibrium 
configurations,  one  may  find  that,  at  some  point,  the  equi- 
librium changes  from  stable  to  unstable.  One  may  antici- 
pate that  nonuniqueness  of  the  equilibrium  solutions  occurs 
at  such  a point.  This  possibility  is  now  fairly  widely 
recognized  among  astrophysicists  in  virtue  of  the  actual 
numerical  construction  of  multiple  solutions  in  some  cases 
[27] . This  contrasts  with  a long-held  view  that  equations 
(15) -(19)  must  have  a unique  solution  (The  Vogt-Russell 
Theorem;  see,  e.g.,  [3]). 

Even  at  points  where  equations  (15) -(19)  have  a unique 
solution,  the  more  general  equations  ( 1 ) — ( 8 ) may  allow  so- 
lutions beyond  the  spherically  symmetric,  equilibrium  solu- 
tions. If  one  linearizes  the  equations  (15)-(18),  the  per- 
turbations of  the  known  solution  are  required  also  to  have 
spherical  symmetry.  Consequently,  if  one  wants  to  test  the 
stability  of  a solution  of  (15) -(19)  to  non- spherical  dis- 
turbances, it  is  necessary  to  return  to  the  more  general 
equations  (1)— (8) , and  linearize  these  about  the  state  of 
spherical  equilibrium.  In  order  to  study  stability  against 
convection,  the  purpose  of  this  section,  this  more  general 
procedure  i's  essential,  because  the  convective  motions  are 
not  radially  symmetric. 

In  linearizing  the  equations  ( 1 ) — ( 8 ) for  the  purpose  of 
determining  the  onset  of  convective  instability,  the  assump- 
tion is  usually  made  that  equation  (3)  can  be  replaced  by 


(20) 


Ds 

Dt 


0. 
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This  says  that  the  motion  is  adiabatic,  i.e.,  that  a parcel 
of  fluid,  as  it  moves  along,  does  not  exchange  heat  with  its 
surroundings.  The  physical  basis  for  this  argument  is  that 
the  convective  motions,  stable  or  unstable,  are  on  the  very 
short  timescale  T^  ("dynamical"  timescale) , 

(21)  T:  = (Gp)"1/2, 


where  p is  the  mean  density;  whereas  the  righthand  side 
of  equation  (3)  only  changes  the  entropy  very  slowly,  on 
the  so-called  contraction  timescale  T^ : 


(22) 


where  L is  the  luminosity.  Hence  the  change  in  entropy 
should  be  negligible  during  the  convective  Jnotions. 

Accepting  this  reasoning,  one  finds  that  the  stability 
problem  thus  obtained  leads  to  a very  simple  criterion, 
called  the  Schwarzschild  criterion  in  the  astrophysical 
literature  [33].  Let 

(23)  A(r)  = P'  (r)  - M FT-  p'  (r) 


where  y is  the  ratio  of  specific  heats.  Then  the  star  is 
stable  against  convection  if  A > 0 for  0 £ r £ R,  un- 
stable if  A < 0 in  any  r-interval.  An  alternative  formu- 
lation in  terms  of  the  temperature  gradient  is  more  famil- 
iar in  meteorology.  Let  the  "adiabatic  gradient"  T^d  be 
defined  as  follows: 

(24)  T^d  = T (r ) p ' (r)  . 

Then  one  concludes  stability  if  | T * | < |T^d|,  0 £ r £ R, 
and  instability  otherwise. 

The  spectrum  of  eigenvalues  associated  with  this  sta- 
bility problem  is  peculiar,  and  must  affect  the  character 
of  the  solution  (if  there  is  one)  bifurcating  at  the  onset 
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of  instability.  Typically  [7,  28,  13]  the  spectrum  divides 
into  two  classes:  (1)  that  for  which  the  eigenvalues  and 
eigenvectors  of  the  linear  stability  problem  represent 
sound  waves,  (2)  that  for  which  they  represent  the  much 
slower,  convective  motions.  The  modes  of  class  (1)  are 
safely  stable.  Those  of  class  (2)  are  all  stable  if  A > 0 
everywhere,  but  infinitely  many  of  them  become  unstable 
when  A < 0 somewhere.  Hence  if  the  chemical  composition 
of  the  star  changes  slowly  and  in  such  a way  that  A 
changes  from  positive  to  negative  in  some  r-interval  (no 
matter  how  small  this  interval  or  how  small  | A | is  in 
it) , infinitely  many  eigenvalues  of  the  linear  problem  pass 
from  stable  to  unstable  type. 

The  principal  gap  in  the  theory  of  stellar  structure 
and  evolution  appears  at  this  point.  What  is  the  charac- 
ter of  the  steady-state  solution  (if  there  is  one)  past 
the  point  where  the  static,  spherically  symmetric  solutions 
become  unstable?  Even  if  there  is  a solution,  it  is  likely 
to  be  so  complicated  that  calculating  it  is  a practical 
impossibility.  Astrophysicists  would  be  happy  to  have  an 
answer  to  the  following  more  limited  question.  How  much 
heat  is  transported  by  the  convection? 

The  circumstance  that  infinitely  many  eigenvalues  of 
the  linear  problem  change  from  stable  to  unstable  type 
when  A changes  sign  would  mean  that  the  familiar  exist- 
ence theorems  of  bifurcation  theory  fail  to  apply,  i_f  the 
stability  problem  corresponded  to  the  steady-state  equa- 
tions in  the  usual  way.  But  it  does  not,  because  we  have 
replaced  equation  (3)  with  equation  (20).  What  would 
happen  if  the  stability  theory  based  on  equation  (3)  were 
carried  out?  This  is  not  known.  Even  if  it  made  things 
better  from  the  viewpoint  of  existence  theory,  it  might  not 
help  from  the  viewpoint  of  finding  the  fluid  motions  or 
estimating  the  heat  transport,  since  many,  many  of  the 
linear  problem's  eigenfunctions  would  probably  be  impli- 
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cated  in  any  realistic  description  of  the  flow  or  the  heat 
transport. 

Let  us  conclude  this  section  by  outlining  the  procedure 
actually  followed  by  astrophysicists.  Suppose  a sequence 
of  static  models  is  constructed  along  the  lines  described 
in  the  preceding  section,  and  suppose  that,  at  some  point 
in  this  sequence,  A becomes  negative  in  some  layer 
r^  < r < r^,  so  that  convection  sets  in.  Crude  estimates 
valid  for  many  stars  show  that  convection  is  far  more  effi- 
cient than  radiation  in  transporting  heat.  To  maintain 
the  star's  luminosity  would  require,  in  a convective  region, 
a temperature  gradient  so  minutely  in  excess  of  that 

the  difference  would  be  negligible.  One  therefore  arrives 
at  the  following  procedure.  Continue  to  assume  that  the 
star  is  static  and  spherically  symmetric,  but  replace  equa- 
tion (18)  by 

- [ ( 4tt  r ^ ) 2pQ]  if,  as  a result,  A > 0 

(y-1) T dp  otherwise  , 

p dm 

Hence  one  constructs  two  families  of  equilibrium  configura- 
tions, one  in  radiative  equilibrium,  one  in  convective  equi- 
librium. The  latter  family  is  stable  past  the  point  where 
the  former  is  not. 

The  prescription  (25)  is  adequate  for  many  stars,  but 
fails  in  some  cases,  where  the  balance  between  radiative 
and  convective  transport  is  more  delicate.  In  these  cases 
one  tries  to  estimate  the  heat  transport  via  the  so-called 
mixing-length  theory.  These  estimates  are  rather  crude. 
Substantial  efforts  have  been  made  toward  improving  these 
estimates,  but  they  have  not  yet  succeeded  in  providing  a 
reliable  means  for  doing  so. 

4 . Rapid  Rotation:  Riemann's  Idealized  Models 

Rotation  is  a common  feature  of  the  universe:  stars 
rotate  more  or  less  rapidly  (the  sun  less  rapidly) , binary 


(25)  s- 


268 


NORMAN  R.  LEBOVITZ 


1 

stars  rotating  about  a common  center  of  mass  make  up  about 

half  the  stars  in  the  sky,  galaxies  tend  to  be  highly  flat-  | 

tened  by  rotation.  Accordingly,  it  is  of  interest  to  study  i 

the  evolution  and  dynamics  of  rotating,  self-gravitating  ^ 

masses.  There  are  numerous  obstacles  in  the  path  of  such  a 
study,  however.  For  example,  the  gravitational  force  is 
much  more  difficult  to  treat  when  the  figure  is  distorted 

from  the  spherical  by  rotation  or  other  fluid  motions , and  I 

the  free-surface  aspect  of  the  boundary  conditions  becomes 
a severe  hindrance  as  well.  These  difficulties  are  manage- 
able when  the  rotation  is  small  enough  that  the  distortion 

from  the  spherical  is  small  (cf.  [2]) , but  for  the  case  ^ 

when  the  rotation  is  sufficiently  large  to  have  a decisive  I 

J 

influence  on  the  dynamics  or  evolution,  the  distortion  is 
likely  to  be  large. 

Various  approaches  to  the  problem  of  finding  steady- 
state  figures  having  large  amounts  of  angular  momentum  have 
been  used.  Numerical  models  have  been  constructed  [26]  and 
existence  theorems  have  been  proved  [1,  22].  These  ap- 
proaches have  not  yielded  enough  information  to  enable  one 
to  make  far-reaching  inferences  concerning  the  effect  of 
angular  momentum  on  the  evolution  of  a star.  There  remain 
two  approaches:  Physical  reasoning  and  idealized  models. 

We  shall  concentrate  on  the  latter,  though  the  former  is 
inevitably  involved  in  interpreting  the  results.  The  ideal- 
ized models  are  figures  of  uniform  density  whose  bounding 
surfaces  are  ellipsoids.  The  restriction  to  uniform  density 
makes  these  figures  poor  candidates  for  stars,  which  are 
centrally  condensed,  but  one  can  hope  that  their  behavior 
is  qualitatively  similar  to  that  of  more  realistic  models. 

Seeking  solutions  bounded  by  an  ellipsoidal  surface 
enormously  reduces  the  free-surface  aspect  of  the  problem. 

If  the  semiaxes  are  a^a^a^,  one  must  determine  these 
three  parameters,  rather  than  the  infinitely  many  needed 
to  characterize  a general  surface.  The  further  assumption 
that  the  density  p is  independent  of  nosition  gives  for 
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the  gravitational  potential  the  well-known  [4]  expression 

, t 2 

(26)  l/(x)  = TrGp  {I  - Z A,  x,  ^ } , Z ~ < 1, 

k=l  K K k=la  1 
k 

where  the  axes  of  x1,x2,x3  are  chosen  along  the  principal 
axes  of  the  ellipsoid.  The  coefficients  I and 
A^(k  = 1,2,3)  are  known  functions  of  the  semiaxes 
a^(k  = 1,2,3).  In  1742,  Maclaurin  showed  that  an  oblate 
spheroid  (a2  = a-^a.^  £ a^)  could  be  a figure  of  relative 
equilibrium  in  a frame  of  reference  rotating  with  angular 
velocity  u about  the  x^-axis  provided 

2 

2 a 3 a 3 

(27)  w = 2rGp  (A. A ) , 0 < — < 1. 

1 a^  J al 


These  Maclaurin  spheroids  were  thought  to  be  the  only  fig- 
ures of  equilibrium  of  ellipsoidal  shape  until  1834,  when 
Jacobi  discovered  that  ellipsoids  with  three  unequal  axes 
are  possible  if 


(28) 


a 3A3 


2 2 
ala2 
2 2 
ara2 


(A2_A1} ' 


and 


(29) 


w = 2ttGp 


2 2 
alAl_a2A2 
2 2 
al"a2 


1, 


Equation  (28)  gives  a^/a^  as  a function  of  a^a^,  hence 
specifying  the  shape  of  the  ellipsoid.  In  the  limit  as 
a2  al ' one  a3^al  = 0.5827*-..  For  this  value  of 

a^/a^,  there  also  exists  a Maclaurin  spheroid.  One  can 
show  that  the  limiting  value  of  the  angular  velocity  (29) 
agrees  with  that  given  by  equation  (27) . Hence  the  limit- 
ing Jacobi  figure  is  a Maclaurin  spheroid:  The  Jacobi 
ellipsoids  bifurcate  from  the  Maclaurin  spheroids  at  this 
point. 
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In  a paper  published  in  1861,  Riemann  substantially 
generalized  these  results,  still  in  the  context  of  ellip- 
soidal figures  of  uniform  density.  He  sought  ellipsoidal 
figures  satisfying  equations  (1)  and  (2)  (equation  (3)  gets 
decoupled  and  can  be  ignored  if  p is  constant  in  space 
and  time)  which  had  principal  axes  aligned  along  coordin- 
ate axes  in  a frame  of  reference  rotating  with  angular 
velocity  w;  relative  to  this  frame,  there  is  in  general  a 
velocity  field  whose  components  are  linear  functions  of 
position.  He  found  that  this  velocity  field  could  be  char- 
acterized by  a vector  X.  These  assumntions  lead  to  a sol- 
ution  of  equations  (1)  and  (2)  provided  the  time-dependent 
parameters  a^(k  = 1,2,3),  w,  and  \ satisfy  a certain 
nonlinear  system  of  ordinary  differential  equations. 

The  discussion  here  will  be  limited  to  those  Riemann 


ellipsoids  for  which 

0)  = 

(0,0,o))  and 

X = (0,0 

,A)  . 

The 

velocity  field  relative  to 

the  rotating 

frame  is 

given 

by 

al 

(30)  u.  = — Xx0  + 

al 

a2. 

u_  = - — Xx, 

a2 

+ X-  , 

u_  = 

a3 

— X-.  , 

1 a2  2 

al  1 

2 al  1 

a2  2 

3 

a3  3 

where  the  dot  renresents  time  derivative.  Referring  to 
equation  (1) , where  the  velocity  field  v is  relative  to 
inertial  space,  we  have 


(31) 

V = 0)  * X + u, 

•W  , V 

and 

, 2 
3 x. 

(32) 

P ( X , t ) = p (t)  (1  - Z — TT 
c k=l  a. 

The  differential  equations  are: 


(33) 

a‘i  = a^(XZ+u)  ) 

- 2a2Xo)  - 

2irGpa^A^  + 

2pc/pai ' 

(34) 

a2  = a2(X2+u)2) 

- 2a^Xu)  - 

2irGpa2A2  + 

2Pc/pa2 ' 

(35)  a’3  = -2irGpa3A2  + 2pc/pa3, 
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(36) 

a-^A  - a2o)  + 2a^A 

- 2a2o>  = 0, 

(37) 

a u>  - a2A  + 2a-^io 

- 2a2A  = 0. 

Equations  (36)  and  (37) 

can  be  integrated 

(36') 

( ai_a2 ) 2 (w+A) 

= ^ J - i C 2 2K. 
M it  1 

2 

5 1 

(37’) 

(al+a2^  (u-A) 

= g J + ; C E 2K, 
M it  2 

where  J is  the  angular  momentum  and  C the  equatorial 
circulation,  both  conserved  under  Riemann ' s assumptions. 
Equation  (2)  above  expressing  mass  conservation  is  replaced, 
under  the  present  assumptions,  by 


(38) 


3 irpa1a2a3 


M. 


Of  course,  if  the  fluid  is  incompressible,  then  p is  con- 
stant and  aga2a3  is  constant.  These  equations,  as  well 
as  various  modifications  of  them,  have  been  integrated  nu- 
merically [30 , 32]  . 

Consider  the  equilibrium  solutions  of  equations  (33)- 
(37).  One  finds  without  difficulty  that  thi y are  satisfied 
if  and  only  if  two  relations  of  the  following  form  hold: 


(39) 

(40) 


(a>+A) 2 
<w-A) 2 


a2  aT 

ffGpf  (■——»—■—)  , 
31  al 


a2  a3 
TTGpg(-^,-^ 

d ^ ^ ) t 


where  f and  g are  certain  combinations  of  a,  and  A, 

k k 

(k  = 1,2,3)  which  are  homogeneous  of  degree  zero  in 

al / &2'  a3*  One  can  assume  without  loss  of  generality  that 

a2  1 a] • 0ne  can  then  infer  that  a3  £ a1 , and  the  domain 

a2  a3 

(- — ) (t — ) -plane  where  equilibrium  solutions  exist  is  inside 
al  al 
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the  horn-shaped  region  of  the  Figure  . To  each  point  in 
this  domain  there  correspond  two  essentially  distinct 
Riemann  ellipsoids.  This  can  be  seen  easily  from  equations 
(39)  and  (40),  which  are  unchanged  if  w and  X are  in- 
terchanged. Two  solutions  of  the  equations  obtained  from 
one  another  in  this  way  are  called  "adjoint".  Along  the 
boundaries,  the  figures  are  self-adjoint.  This  notion  of 
adjointness  extends  to  the  time-dependent  equations  (33)- 
(37) , and  even  to  the  more  general  time-dependent  equations 
where  no  assumption  is  made  regarding  the  directions  of  o> 
and  X (cf.  [4] , where  a full  discussion  of  the  Riemann 
ellipsoids  is  given) . 

5.  The  Fission  Theory:  J = const.,  X = 0. 

The  influence  of  angular  momentum  on  a star's  evolution 
is  likely  to  be  particularly  pronounced  in  those  stages  of 
the  star's  existence  when  it  undergoes  a large  contraction. 
One  of  the  persistent  ideas  in  astronomy  is  that  a binary 
system  might  be  the  outcome  of  the  evolution  of  a rotating, 
self-gravitating  mass  undergoing  contraction.  Since  about 
half  the  stars  in  the  sky  are  binary  stars , and  the  mechan- 
ism whereby  they  are  formed  is  not  at  all  agreed  upon,  this 
idea,  that  a single  star  might  undergo  fission  to  create  a 
pair,  has  received  considerable  attention. 

The  notion  was  first  broached  in  1883  in  the  second 
edition  of  the  famous  physics  text  by  Thomson  and  Tait 
(35].  They  noted  that,  for  the  same  mass,  density,  and 
angular  momentum,  the  Jacobi  ellipsoid  has  lower  energy 
than  the  Maclaurin  spheroid.  They  reasoned  that,  in  the 
presence  of  viscosity  to  dissipate  the  energy  difference, 
the  Maclaurin  spheroids  in  question  should  be  unstable  to 
a disturbance  deforming  them  in  the  direction  of  the  Jacobi 
ellipsoids.  This  conjecture  has  only  been  verified  rather 
recently  [31].  They  further  noted  that  the  total  energy 
of  a binary  system  having  the  same  mass,  density,  and  angu- 
lar momentum  as  a sufficiently  elongated  Jacobi  ellipsoid, 
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The  boundaries  of  region  where  Riemann 
ellipsoids  exist  are  indicated  by  KpO 
and  K2=0.  The  Maclaurin  sequence  is 
at  the  right  ((^/Qi  =1 ).  The  Jacobi 
sequence  is  dashed  and  indicated  by 

x=o. 
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has  lower  energy.  They  concluded  that,  far  enough  along 
the  Jacobi  sequence,  that  figure  should  be  unstable  to  a 
deformation  tending  to  thin  it  in  the  middle  and  thicken 
i it  at  the  ends. 

This  leads  to  a picture  of  evolution  along  equilibrium 
configurations,  first  described  by  Poincare  [29].  In  the 
following  description,  it  is  necessary  to  note  that  vis- 
cosity plays  a manor  role,  whereas  the  Riemann  ellipsoids 
of  the  preceding  section  were  assumed  inviscid.  However, 
if  X = 0»  we  can  show  that  the  corresponding  figures, 
which  are  precisely  the  Maclaurin  and  Jacobi  figures,  con- 
tinue to  be  steady-state  solutions  even  when  viscosity  is 
taken  into  account.  Hence  one  is  restricted  to  the  family 
of  solutions  X = 0 by  the  assumption  that  viscosity  is 
important.  We  may  further  note  that,  while  the  angular 
momentum  integral  J survives  the  inclusion  of  viscous 
effects,  the  circulation  integral  C does  not.  Poincare's 
description  was  as  follows. 

Suppose  a Maclaurin  spheroid  of  small  eccentricity  con- 
tracts slowly,  i.e.,  its  density  increases  slowly,  its 
angular  momentum  being  fixed.  One  can  show  that  it  then 
. becomes  more  and  more  oblate  (just  as  if  its  density  were 

fixed  and  its  angular  momentum  increased) . It  then  moves 
down  the  Maclaurin  line  until  it  reaches  the  point  where 
the  Jacobi  sequence  bifurcates.  The  latter  being  the 
stable  branch,  it  now  leaves  the  Maclaurin  line,  and  fur- 
ther contraction  causes  it  to  move  along  the  Jacobi  se- 
quence in  the  direction  of  increasing  elongation,  until 
* that  sequence  becomes  unstable.  It  then  departs  from  the 

> Jacobi  shape  in  the  direction  of  a "pear-shaped"  figure, 

having  a constriction  in  the  middle.  The  pear-shaped 
figure  being  assumed  stable,  evolution  along  this  new 
family  of  equilibrium  figures  is  imagined  to  cause  the 
constricted  middle  part  to  grow  slowly  narrower  until  it 
finally  disappears,  leaving  a pair  of  masses  orbiting  one 
another. 


L 
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This  picture  was  based  partly  on  established  results, 
partly  on  conjecture.  In  his  memoir  of  1885,  Poincare 
established  many  of  the  ideas  and  much  of  the  terminology 
of  bifurcation  theory  in  the  context  of  systems  having 
finitely  many  degrees  of  freedom,  and  applied  them  to 
studying  bifurcation  along  the  Jacobi  sequence.  He  found 
that  it  becomes  unstable  to  a particular  disturbance  be- 
longing to  the  third  ellipsoidal  harmonics.  He  drew  a 
diagram  showing  what  the  perturbed  figure  would  look  like 
in  a first  approximation,  and  it  indeed  contains  a con- 
striction, and  looks  rather  like  a pear.  However,  the  dia- 
gram is  inaccurate:  careful  calculations  show  no  such 
constriction,  and  the  figure  looks  more  like  an  egg  than  a 
pear.  This  is  not  so  serious  in  the  theory  as  some  other 
defects,  however. 

Poincare's  analysis  was  linear  and  it  consequently  left 
open  the  question  whether  the  new  series  bifrucates  "for- 
ward" (i.'e.,  is  stable)  or  "backward"  (i.e.  , is  unstable). 
It  is  essential  for  the  picture  above  that  the  bifurcating 
figures  be  stable.  Laborious  computations  are  needed  for 
deciding  this.  They  were  carried  out  first  by  Sir  George 
Darwin  [9]  who  concluded  stability,  next  by  Lyapunov  [23] 
who  concluded  instability,  and  (presumably)  finally  by 
Jeans  [12]  , who  not  only  concluded  instability  in  agreement 
with  Lyapunov,  but  also  tracked  down  an  error  in  Darwin's 
work  which,  when  corrected,  brought  it  into  agreement  with 
the  others.  This  was  a heavy  blow  to  the  fission  theory, 
at  least  as  pictured  by  Poincare.  Others,  notably  Jeans 
himself  [12]  , have  argued  in  its  favor  despite  this  and 
other  setbacks,  but  they  have  not  been  very  convincing. 

6.  The  Evolution  of  Gravitationally  Radiating  Stars: 

C = const. , w = 0. 

There  are  two  stages  in  a star's  existence  when  it  can 
undergo  a large  contraction.  An  early  stage,  before  the 
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I 

I 

temperature  near  the  center  is  high  enough  to  cause  nuclear 
reactions  to  take  place,  and  a late  stage,  after  the  nu- 
clear fuel  has  been  consumed.  In  this  late  stage,  bizarre 
\ things  seem  to  happen.  If  the  mass  of  the  star  is  less 

i 

r than  a critical  mass  M^  (about  1.4m©,  where  Mo  is  the 

mass  of  the  sun) , it  collapses  to  a state  of  high  density 
known  as  a white  dwarf.  If  its  mass  exceeds  M^  it  cannot 
become  a white  dwarf  but  becomes  a neutron  star  at  still 
higher  density,  provided  its  mass  is  less  than  a second 
critical  mass  M2  (about  3Mo)  . If  its  mass  exceeds  M2 
it  collapses  out  of  sight  and  becomes  a black  hole.  At 
some  point,  at  about  the  density  of  a compact  neutron  star, 
the  gravitational  field  becomes  large  enough  for  effects 
, of  general  relativity  to  be  important. 

One  of  these  effects  is  gravitational  radiation.  This 
is  produced  by  the  rapid  motions  of  massive  bodies.  For 
example,  the  rotation  of  a superdense  star  about  an  axis 
with  respect  to  which  it  is  asymmetric  would  generate  such 
waves.  Astrophysicists  have  used  the  Riemann  ellipsoids 
in  an  attempt  to  follow  the  evolution  of  a contracting, 
gravitationally  radiating  mass,  in  the  following  way 
[11,  25]. 

The  Newtonian  equations  (1)  and  (2)  may  be  regarded  as 
the  first  terms  in  a formal  power  series  expansion  of  the 

I 

f general-relativistic  equations  (cf.  [5] ; the  parameter  of 

1 expansion  is  v/c,  where  v is  a characteristic  velocity 

and  c is  the  speed  of  light) . At  a certain  stage  in  this 
> expansion,  a term  appears  on  the  right-hand  side  of  (1) 

that  is  identified  as  the  radiation  reaction:  it  repre- 
k sents  the  effect  on  the  motions  of  the  radiation  of  gravi- 

tational energy.  To  find  the  effect  of  gravitational  ra- 
diation, one  neglects  all  relativistic  contributions  but 
this  one.  Hence  the  equations  are  as  before,  with  the 
exception  of  the  forcing  term  on  the  right-hand  side.  One 
can  ask  whether  ellipsoidal  solutions  of  the  kind  envisaged 
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by  Riemann  are  still  possible.  Now,  the  radiation-reaction 
term  is  the  gradient  of  a scalar  l/R,  where 


(41) 


VR-  - 


15C“ 


dt 


?)xjxk' 


and 


px . x,  dx 
: k 


x dx 


( j ,k  = 


1,2,3)  ; 


summation  on  j ,k  is  implied  in  equation  (41).  Since  l/R 
is  quadratic  in  the  coordinates,  one  indeed  finds  that 
ellipsoidal  solutions  are  possible.  Equations  (33) -(38) 
above  are  suitably  modified  [11] . The  circulation  integral 
C survives  the  introduction  of  radiation  reaction  because 
it  is  conservative,  but  angular  momentum  is  no  longer  a 
conserved  quantity.  What  about  steady-state  solutions? 

The  Maclaurin  spheroids,  being  symmetric  about  the  ro- 
tation axis , do  not  radiate  and  continue  to  be  steady- 
state  solutions  of  the  relativistically  modified  equations. 
Any  Asymmetric  ellipsoid  would  have  to  be  at  rest  in  iner- 
tial space  in  order  not  to  radiate,  i.e.,  in  order  to  be 
in  a steady-state.  Such  peculiar  objects  indeed  exist. 

This  can  be  seen  as  a consequence  of  the  existence  of  ad- 
joint configurations  (Section  4) . The  configuration  ad- 
joint to  a Jacobi  ellipsoid  (w  = wo,X  = 0)  is  a figure 
with  the  same  shape,  but  w = 0 , X = : its  surface 

appears  at  rest  in  inertial  space,  and  it  is  supported  by 
internal  motions.  Hence,  in  the  presence  of  gravitational 
radiation,  the  sequence  of  steady-state  figures  along  which 
evolution  may  proceed  is  characterized  by  C = const.  , 
w = 0,  in  remarkable  reciprocity  to  the  classical  fission 
theory. 

The  evolutionary  picture  is  now:  contraction  makes  the 
Maclaurin  spheroids  progressively  more  oblate;  at  the  same 
point  as  that  where  the  Jacobi  sequence  branches  off,  the 
Maclaurin  spheroids  become ‘unstable ; further  contraction 
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takes  place  along  the  adjoint  to  the  Jacobi  sequence 
(called  the  Dedekind  sequence) ; the  latter  also  becomes 
unstable  to  disturbances  described  by  third  ellipsoidal 
harmonics,  at  a different  point  along  the  sequence  from  the 
point  where  the  "pear-shaped"  figures  branch  off  from  the 
Jacobi  sequence.  Nothing  further  is  known  about  the  bifur- 
cating figures  at  this  point,  and  only  the  wildest  specu- 
lations can  be  made  regarding  the  creation  of  superdense 
binary  stars. 

7 . The  Fission  Theory  Reformulated:  J = const,  and 

C = const. 

In  each  of  the  preceding  sections,  dissipative  mechan- 
isms were  invoked  to  infer  that  the  Maclaurin  sequence  be- 
comes unstable  past  the  point  where  the  Jacobi-Dedekind 
sequence  intersects  it.  Neither  of  these  dissipative 
mechanisms  is  contemplated  in  the  equations  of  Section  4. 

If  they  are  indeed  absent,  then  J and  C are  both  con- 
stant (eqs.  (36')  and  (37') )•  The  constancy  of  J and  C 
delineates  a family  of  steady-state  figures  (just  as  the 
constancy  of  J and  the  vanishing  of  A , or  the  constancy 
of  C and  the  vanishing  of  co)  . In  particular,  a config- 
uration starting  on  the  Maclaurin  line  would  have  = 0 

initially,  hence  always.  The  unique  asymmetric  family  of 
ellipsoids  having  = 0 is  so  labeled  in  the  Figure.  It 

intersects  the  Maclaurin  sequence  at  the  point  where  the 
latter  becomes  unstable  in  the  absence  of  dissipation. 

This  point  where  instability  sets  in  was  discovered  by 
Riemann. 

A reformulation  of  the  fission  theory  along  the  lines 
described  above  has  been  initiated  in  [14].  Neglecting 
dissipation  seems  reasonable  in  view  of  estimates  as  to  its 
effectiveness  (cf.  [24]),  but  a potentially  more  important 
feature  has  been  incorporated  into  this  reformulation: 
compressibility.  In  the  theories  outlined  in  Sections  5 
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and  6,  it  is  assumed  that  the  density  increases  along  the 
sequences  considered,  but  the  stability  results  are  ob- 
tained under  the  assumption  of  incompressibility.  This  not 
only  seems  inconsistent,  but  also  suppresses  changes  in  the 
internal  energy,  which,  in  a star,  are  comparable  in  size 
with  the  changes  in  kinetic  and  gravitational  energies.  It 
turns  out  that  Riemann's  equations  (Section  4)  remain  val- 
id if  p is  a function  of  t (and  t only) . Then  one 
must  adjoin  the  energy  equation  (3)  and  an  equation  of 
state.  This  is  done  in  [14] , and  indeed  leads  to  evolu- 
tion along  the  branch  of  ellipsoids  marked  = 0.  These 

become  unstable  at  the  point  indicated  in  the  Figure  [15] . 
Whether  the  bifurcating  figures  are  stable  or  not  is  not 
yet  known,  but  the  stability  analysis  is  now  sufficiently 
different  (because  the  effect  of  compressibility  is  includ- 
ed) , that  one  may  seriously  entertain  the  possibility  that, 
in  contrast  to  the  classical  result,  they  are  stable. 

Whatever  the  outcome  of  the  fission  theory  may  turn  out 
to  be,  the  problems  outlined  in  this  and  the  preceding  sec- 
tions contribute  to  answering  the  fundamental  question 
expressed  in  the  last  sentence  of  the  Introduction. 

8.  Singularly  Perturbed  Initial-Value  Problems 

The  evolutionary  problems  discussed  here,  in  common 
with  a variety  of  analogous  problems  from  scientific  fields 
other  than  astrophysics,  have  a certain  paradoxical  ele- 
ment: on  the  one  hand,  the  figures  are  in  a steady  state 

whereas,  on  the  other  hand,  they  evolve.  Physically  there 
is  no  paradox.  One  notes  that  (at  least)  two  timescales 
are  relevant  to  the  problem,  a short  timescale  T^  and  a 
long  timescale  T2  (cf . , e.g.,  equations  (21)  and  (22) 
above) . During  evolution  on  the  long  timescale  the  system 
may  be,  to  a good  approximation,  in  a steady-state  with 
respect  to  motions  on  the  short  timescale.  To  the  extent 
that  there  is  a mathematical  paradox,  it  too  can  be  re- 
solved, as  follows. 
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Consider  a system  of  ordinary  differential  equations  of 
the  form  (x  and  y may  be  vectors) 

(42)  = f (x,y) , = g(x,y) , 

where  e is  a small  parameter  (the  ratio  of  the  timescales 

and  , say).  These  equations  are  vastly  simpler  than 
those  of  Section  2 governing  stellar  evolution,  but  certain 
analogies  can  be  made.  The  chemical  composition,  whose 
evolution  equation  was  not  written  down  in  Section  2 , is 
represented  by  the  vector  x,  and  the  remaining  variables 
(v,p,s)  by  y.  Then  (x,y)  indeed  satisfy  a system  of 
equations  analogous  to  (42),  provided  that  time  t is 
measured  in  the  unit  (i.e.,  on  the  long  or  evolution- 

ary, timescale).  The  "reduced"  system  of  equations  is  (for 
E = 0) 

(43)  = f <X'Y>  ' 0 = g(x,y). 

In  this  system,  the  second  equation  is  the  equation  for  the 
steady  state.  Hence,  if  for  small  e the  solutions  of 
(43)  approximate  the  solutions  of  (42)  accurately  enough, 
one  can  indeed  speak  of  evolution  along  steady-state . solu- 
tions. 

The  system  (42)  is  a "singular"  perturbation  of  (43) 
in  the  sense  that  it  can  accomodate  initial  data  for  x 
and  y,  whereas  only  intial  data  for  x can  be  specified 
in  connection  with  (43).  This  has  the  consequence  that 
solutions  of  (42)  can,  at  best,  approach  those  of  (43) 
non  uniformly  in  t as  e 0+.  Moreover,  solutions  of 
(42)  need  not  approach  those  of  (43)  at  all,  unless  a cer- 
tain condition  is  satisfied.  This  condition  may  be  stated 
in  the  following  way  [20]:  the  Jacobian  matrix  gy(x,y) 
should  be  stable  along  the  solution  of  (43) , i.e.,  all  its 
eigenvalues  should  have  negative  real  parts  there.  This 
condition  can  be  related  to  the  stability,  on  the  short 
timescale,  of  a steady-state  configuration  with  x viewed 
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as  a parameter  (exactly  as  one  does  on  physical  grounds) . 

If  the  reduced  path  satisfying  (43)  encounters  a point  of 
nonuniqueness  (a  bifurcation  point),  this  condition  fails, 
for  g^  must  be  singular  there.  For  the  most  familiar 
kinds  of  bifurcation  points  one  can  nevertheless  analyze 
the  relations  of  (42)  to  (43)  [16,  17,  21],  with  the  results 

that  the  solution  of  (42)  "usually"  stays  close  to  stable 
branches  of  the  steady-state  solutions  (although  this  can 
fail  and  the  solution  of  (42)  become  rapidly  variable  in 
time) . Note  that  the  nonuniqueness  associated  with  the  bi- 
furcation problem  and  hence  also  with  the  reduced  problem 

(43)  in  no  way  contradicts  the  usual  uniqueness  conclusion 
for  the  "real"  problem  (42). 

We  note  finally  that  the  standard  form  of  bifurcation 
problems  with  t as  a parameter  is  [8] 

(44)  G(t,y)  = 0. 

Equation  (43)  would  reduce  to  this  if  f were  independent 
of  y.  In  general  this  is  not  true,  however,  and  bifurca- 
tion in  evolving  systems  is  more  accurately  described  by 
(43)  . 
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Bifurcation  Phenomena  in  Chemically 
Reacting  Systems 
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ABSTRACT 

An  enormous  amount  of  theoretical,  computational,  and 
experimental  work  has  been  reported  showing  multiple  steady 
states,  pattern  formation,  and  oscillatory  phenomena  in 
chemically  reacting  systems.  Although  a number  of  the 
systems  studied  have  been  analyzed  through  the  application 
of  bifurcation  theory,  many  more  important  problems  remain 
virgin  and  untouched  by  this  powerful  tool  of  analysis. 

This  paper  discusses  a wide  range  of  problems  of  interest 
to  engineers  and  scientists  involving  heat  and/or  mass 
transfer  with  chemical  reaction  and  which  have  been  or  can 
be  profitably  treated  by  bifurcation  theory.  Case  studies 
are  presented  showing  some  of  the  results  which  may  be 
obtained  for  chemical  reactors  and  how  these  results  are 
useful  to  the  engineer. 

INTRODUCTION 

There  are  a wide  range  of  processes  involving  chemical 
reaction  which  have  been  observed  to  exhibit  bifurcation  to 
steady  or  time  periodic  states.  Although  the  first  experi- 
mental observations  of  these  phenomena  were  certainly  made 
by  prehistoric  cave  men  carrying  out  the  oxidation  of 
cellulose  (i.e.  wood  burning),  they  could  not  have  under- 
stood the  complexities  of  ignition  and  extinction  of  the 
flame  and  the  flame  oscillations  they  were  observing.  (cf 
Figure  1).  Some  of  the  earliest  scientific  reports  of 
multiple  steady  states  and  oscillatory  behavior  were  with 
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Figure  1.  Early  Observations  of  Bifurcation  Phenomena 
in  Chemically  Reacting  Systems. 


regard  to  catalytic  and  non-cataly tic  oxidation  and  oxida- 
tion-reduction reactions  [1-4]  . Since  this  early  work, 
similar  observations  have  been  reported  for  biological 
reactors,  polymerization  process,  homogeneous  and 
heterogeneous  tubular  reactors,  both  open  and  closed 
homogeneous  reaction  systems  with  complex  kinetics,  single 
catalyst  particles,  and  catalytic  wires  just  to  name  a few. 
Thus  the  scope  for  potential  applications  of  bifurcation 
theory  appears  to  be  very  large  indeed. 

In  this  paper,  the  goal  is  to  provide  an  overview  of 
the  many  chemically  reacting  systems  where  multiple  steady 
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states  and  periodic  phenomena  have  been  observed  and  to 
show  how  some  of  these  have  been  analyzed  through  bifurca- 
tion theory.  Therefore,  in  the  next  section,  a survey  is 
presented  which  is  designed  to  provide  the  reader  with  a 
guidebook  to  the  enormous  literature  in  the  field.  Follow- 
ing this  a case  study  shall  be  described  in  which  bifurca- 
tion theory  has  led  to  the  successful  and  comprehensive 
elucidation  of  the  structure  of  the  problem.  Finally,  a 
number  of  promising  problem  areas  needing  analysis  will  be 
described . 

LITERATURE  SURVEY 

It  would  be  an  overwhelming  task  to  survey  exhaus- 
tively the  extensive  and  wide  ranging  literature  on  bifurca- 
tion phenomena  in  chemically  reacting  systems.  Therefore, 
the  present  survey  will  be  selective,  emphasizing  those 
areas  of  greatest  interest  and  familiarity  to  the  author, 
and  shall  rely  on  existing  reviews  of  particular  subdomains 
of  the  field. 

To  begin,  we  shall  consider  branching  phenomena  which 
have  been  observed  experimentally  and  through  numerical 
simulation  and  without  considerations  of  analysis  by 
bifurcation  theory.  In  a very  interesting  review  of 
oscillating  reaction  systems,  Higgins  [5]  discusses  the 
types  of  oscillations  observed  and  the  particular  kinds  of 
kinetic  mechanisms  which  might  simulate  the  oscillations. 

His  main  emphasis  was  biological  systems.  More  recently, 
two  additional  reviews  over  the  dynamics  of  biological 
systems  [6,7]  also  provide  many  useful  references.  A 
rather  general  overview  of  chemical  oscillations  was 
contributed  by  Nicolis  and  Portnow  [8]>  however,  there  are 
fundamental  errors  in  their  discussion  of  thermochcmical 
oscillations  and  one  should  treat  this  review  with  caution. 
The  literature  on  the  dynamic  behavior  of  reactors  of 
interest  to  the  chemical  industry  has  been  reviewed  rather 
extensively.  Ray  [9],  Sheintuch  and  Schmitr  [10],  and 
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Hlavacek  and  Vortruba  [11]  discuss  the  behavior  of  catalytic 
reactors,  while  Schmitz  [12]  and  Gilles  [13]  review  the 
reported  results  for  both  homogeneous  and  heterogeneous 
reactors.  Recently  a great  deal  of  interest  has  been 
devoted  to  reactions  such  as  the  Belousov-Zhabotinsky 
reaction  which  show  both  temporal  and  spatial  periodicities 
even  in  a semi-closed  system.  These  systems  and  their  wave 
patterns  are  extensively  discussed  in  the  fine  review  by 
Marek  [14],  who  himself  has  reported  some  of  the  most 
interesting  comparisons  of  experiments  and  theory  with  this 
reaction  (e.g.,  [15-19]). 

In  addition  to  the  review  articles  noted  above,  there 
are  a number  of  monographs  which  are  devoted  to  these 
problems  (e.g.,  [20-27]).  Although  these  review  articles 

and  monographs  provide  an  entree  into  the  literature  in  each 
area,  it  is  instructive  to  consider  some  of  the  principal 
phenomena  observed  in  the  various  fields  and  what  efforts 
have  been  made  to  apply  bifurcation  theory  to  the  problem. 

Biological  and  Biochemical  Systems 

There  is  at  present  an  enormous  amount  of  interest 
among  mathematicians  as  well  as  natural  scientists  and 
engineers  in  the  oscillations  (e.g.,  biological  clocks), 
multiple  steady  states  (e.g.,  biological  switches), 
travelling  waves  and  pattern  formation  found  in  biologically 
reacting  systems.  It  is  beyond  the  scope  of  the  present 
effort  to  comprehensively  discuss  this  field.  However,  one 
can  note  that  major  efforts  are  being  made  both  in  biolog- 
ical living  systems  as  well  as  in  biochemical  reactors  for 
wastewater  treatment,  fermentation,  and  food  production 
(e.g.,  [6,7,22,25,27-66])  . The  major  thrust  at  present 

seems  to  be  to  determine  the  richness  of  bifurcation 
phenomena  possible  from  simple  reaction  networks  in  an 
effort  to  establish  the  simplest  possible  biological  models 
capable  of  predicting  observed  phenomena.  A number  of 
directions  seem  to  be  evolving  in  the  effort  to  solve  these 


BIFURCATION  IN  CHEMICALLY  REACTING  SYSTEMS 


289 


problems.  Thom's  Catastrophe  Theory  [28]  makes  use  of  a 
visual  topological  approach  in  attempting  to  explain 
morphogenesis  and  other  observed  bifurcation  phenomena. 
Pattern  formation,  waves  and  other  spatial  structures 
arising  from  partial  differential  equations  are  being 
analyzed  [67-78]  in  an  effort  to  understand  experimental 
results  arising  from  biological  systems  as  well  as  model 
reaction  networks.  The  goal  of  this  work  is  to  explain 
nerve  signal  transmission  and  other  biochemical  phenomena 
through  a study  cf  simple  chemical  model  systems  [79-110] . 
The  most  heavily  studied  model  system  is  the  Belousov- 
Zhabotinski  Reaction  [93]  involving  malonic  acid,  cerium 
ions,  and  bromate  ions.  This  system  is  an  interesting  one 
because  it  is  relatively  simple  and  yet  it  displays  both 
temporal  periodicities  as  well  as  interesting  dynamic 
pattern  formation.  The  literature  on  this  reaction  is 
astonishingly  large;  thus  only  a few  key  papers  describing 
observed  phenomena  [14-18,93-100]  will  be  noted  along  with 
those  efforts  to  apply  bifurcation  theory  to  the  system 
[14,19,78,101-110].  Two  particularly  good  discussions  of 
the  current  work  on  analyzing  this  network  may  be  found  in 
the  surveys  of  Marek  [14]  and  Othmer  [78]. 

Non-cataly tic  Chemical  Reactors 

There  are  many  types  of  chemical  reactors  which  do 
not  involve  the  action  of  a catalyst.  These  include  the 
homogeneous  continuous  stirred  tank  reactor  (CSTR) , 
heterogeneous  gas-liquid  and  1 iquid- 1 iquid  CSTR1 s,  as  well 
as  non-cataly tic  reactions  carried  out  in  tubes.  A special 
case  of  the  latter  class  of  reactors  are  those  in  which 
combustion  of  a hydrocarbon  takes  place.  It  is  of  interest 
to  those  working  in  bifurcation  theory  that  for  certain 
reactions  and  under  certain  conditions  all  of  the  reactor 
types  mentioned  have  been  observed  to  exhibit  multiple 
steady* states  and  oscillations.  Even  at  thermodynamic 
equilibrium  in  closed  syatems  there  appears  to  be  branching 
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to  multiple  steady  states  (e.g.,  [111,112]).  Excellent  ^ 

surveys  of  the  experimental  observations  and  their  models 

for  the  reactions  of  short  chain  molecules  may  be  found  in 

the  reviews  of  Schmitz  [12]  and  Gilles  [13] » thus  the  only 

augmentation  necessary  is  in  the  areas  of  polymerization 

and  combustion.  Both  homogeneous  and  heterogeneous 

polymerization  reactors  have  been  observed  (either  from 

experiment  or  through  analysis  of  mathematical  models)  to  J 

show  multiple  steady  states  and  oscillations  [113-124]  and 
in  some  cases  these  phenomena  cause  enormous  control 
problems  in  the  industrial  reactor.  Similarly,  multiple 

steady  states  and  oscillations  are  serious  problems  in  the  1 

design  of  combustion  reactors  (e.g.,  heating  devices,  ^ 

rocket  engines,  industrial  reactors  for  olefin  production,  I 

etc.)  and  there  is  a great  literature  (cf  [13,125-130]  for  j 

a brief  overview)  on  these  problems.  { 

Although  branching  to  multiple  steady  states  and  \ 

periodic  solutions  has  been  known  for  more  than  20  years  in 

i 

continuous  homogeneous  stirred  reactors  (e.g.,  [131-134]), 

the  structure  of  the  problem  was  greatly  clarified  by  the 
work  of  Poore  [135]  and  Uppal  et  al_.  [136,137]  who  applied 
Hopf-Fr iedr ichs  bifurcation  theory  to  the  problem.  There 
have  been  several  further  reported  applications  of  bifurca- 
tion theory  to  more  complex  reaction  systems  [122,139-141]. 

The  analysis  of  the  bifurcation  of  periodic  solutions  has 
been  made  a great  deal  easier  through  the  work  of  Poore 
[138]  who  has  reformulated  the  Hopf-Friedrichs  bifurcation 
theory  in  order  to  make  it  both  more  widely  applicable  and 
more  straightforward  to  apply. 

Catalytic  Reaction  Systems 

Catalytic  reaction  systems  most  often  involve  a 
catalytic  agent  on  a solid  support  and  housed  in  a stirred 
vessel  or  tube.  Thus  the  analysis  of  bifurcation  phenomena 
in  these  systems  must  be  made  on  two  levels:  (i)  consider- 

ation of  the  solid  catalyst  alone,  and  (ii)  examination  of 
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the  behavior  of  the  reactor  and  catalyst  material  together. 
This  separation  is  necessary  because  multiple  steady  states 
(both  uniform  and  nonuniform  in  space)  as  well  as  oscilla- 
tions and  waves  have  been  found  at  both  levels  of  analysis. 
As  noted  in  the  introduction,  there  are  a number  of 
excellent  surveys  of  the  experimental  phenomena  and  their 
analysis  for  this  class  of  problems  [9-13,26]  so  that  we 
shall  only  note  a few  key  papers  here  and  then  proceed  to 
a discussion  of  those  very  few  problems  having  already 
been  analyzed  by  bifurcation  theory. 

On  catalytic  surfaces,  experiments  show  that  multiple 
steady  states  and  various  types  of  oscillations  occur 
[9-13,26].  In  addition,  model  calculations  show  that 
standing  waves  and  asymmetric  steady  states  are  to  be 
expected  in  some  cases  (e.g.,  [ 26,142-147]).  However,  it 

appears  that  the  standing  waves  are  expected  to  be  unstable 
and  only  a limited  number  of  asymmetric  profiles  are 
possible  for  the  simple  kinetic  schemes  studied. 

Although  there  are  hundreds  of  published  studies  of 
the  branching  of  multiple  steady  states  on  catalytic 
surfaces  (cf  [9-13,26]),  there  appear  only  to  be  two 
papers  employing  bifurcation  theory  to  study  time  periodic 
solutions  [148,149].  The  first  of  these  [148]  uses  bifur- 
cation theory  to  show  that  simple  thermokinetic  oscilla- 
tions are  not  likely  to  arise  on  catalyst  surfaces.  The 
second  paper  [149]  demonstrates  a useful  lumping  procedure 
and  the  subsequent  application  of  Hopf-Friedr ichs  bifurca- 
tion theory  to  the  resulting  lumped  equations  for  the  case 
of  diffusion  and  reaction  inside  a porous  catalyst. 

Before  leaving  the  subject  of  catalyst  dynamics,  it 
is  useful  to  note  that  relaxation  oscillations  seem  to  be 
playing  an  important  role  in  the  observed  oscillatory 
behavior  of  many  reactions  [10,13].  Since  these  oscilla- 
tions involve  sudden  jumps  between  multiple  quasi-station- 
ary  states,  there  is  an  intimate  relationship  between 
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bifurcation  to  relaxation  oscillations  and  bifurcation  to 
multiple  steady  states.  A comprehensive  theoretical  treat 
ment  of  this  class  of  problems  would  be  very  useful. 

The  behavior  of  catalytic  well  mixed  reactors  may 
exhibit  multiplicities  and  oscillations  both  due  to  the 
reactor  itself  or  due  to  phenomena  on  the  catalyst  surface 
In  some  instances,  it  is  even  possible  for  the  reactor  to 
behave  in  a stable  fashion  when  the  catalyst  by  itself 
would  be  unstable  at  the  same  conditions  [150] . For  the 
tubular  catalytic  reactor,  the  range  of  possible  behavior 
is  even  more  interesting.  In  addition  to  the  existence 
of  multiple  steady  states  (as  many  as  three  stable  non- 
uniform  steady  states  have  been  observed  in  some  instances 
[151]),  one  sees  travelling  waves  of  various  types  [9,13]. 
Although  the  branching  to  multiple  steady  states  has  been 
studied  by  many  workers  [9-13]  , there  do  not  seem  to  be 
any  applications  of  bifurcation  theory  to  the  periodic 
and  wave  behavior  of  these  reactors.  It  appears  that  at 
least  some  of  the  travelling  waves  are  of  a relaxation 
type  controlled  by  the  local  heat  transfer  at  the  reaction 
front,  and  these  can  be  analyzed  through  a quasi-station- 
ary  analysis  [13,152]. 

AN  EXAMP LK  CASE  STUDY  - THE  CSTR 

In  this  section  we  shall  discuss  the  steady  state  and 
dynamic  behavior  of  the  Continuous  Stirred  Tank  Reactor 
(CSTR)  and  the  application  of  bifurcation  theory  to  the 
problem.  For  the  simplest  type  of  reaction  kinetics 
(first  order  irreversible),  the  mathematical  modelling 
equations  take  the  following  form 


(1) 


dx^ 

dt 
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where  x^  denotes  the  conversion  of  reactant  and  x2  a 
dimensionless  temperature  [136,137].  The  system  parameters 
are  B,  a dimensionless  heat  of  reaction;  y a dimension- 
less activation  energy;  3 a dimensionless  heat  transfer 
coefficient;  *2c  a d iroen s ion le s s coolant  temperature, 
and  Da  a dimensionless  ratio  of  reactor  residence  time 
to  reaction  time. 

This  system  of  equations  has  been  under  study  for  more 
than  20  years  (e.g.,  [131-134]),  but  the  structure  of 

the  periodic  behavior  was  not  clear  until  Hopf-Fr iedr ichs 
bifurcation  theory  was  applied  to  the  problem  [135-137]. 

In  this  section,  the  results  of  applying  this  theory  shall 
be  discussed. 

If  one  considers  the  steady  state  behavior  arising 
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it  is  easy  to  show  that  it  has  one  of  the  two  types  of 
behavior  shown  in  Figure  2.  Curve  (1)  corresponds  to  the 
case  where  branching  occurs  as  Da  varies  while  curve  (2) 
shows  the  behavior  in  the  absence  of  branching.  As  noted, 
the  branching  points  xlg  = m^fDa  ) , x2g  = in^  (Da2) 
correspond  to  points  where  the  determinant  of  the 
Jacobian  matrix 


(5) 


Hi 

3 x 


3'Xls'X2s 

vanishes.  It  is  possible  to  show  that  det  A > 0 unless 
the  condition 
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is  satisfied. 
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Figure  3.  CSTR  Behaviour  for  Different  Regions  in 
Parameter  Space 

function  of  3<Y/X2c^  for  which  Tr  A = 0 is  possible. 

Line  SM  is  the  locus  of  parameters  for  which  tr  A = det  A=  0. 

V -v 

Thus  lines  M,S,SM  define  six  regions  in  parameter  space 
which  have  qualitatively  different  dynamic  behavior  due  to 
the  placement  of  bifurcation  points.  Regions  III  and  V 
have  two  distinct  subregions  determined  by  the  direction 
of  bifurcation  of  the  limit  cycles.  The  dividing  lines 
<5 i = 0 , 62  = 0 were  determined  by  a perturbation  analysis 
from  the  bifurcation  points  [135,136,138].  Thus  for  this 
problem,  almost  the  entire  structure  may  be  determined 
purely  from  local  analysis.  Numerical  computations  were 
used  to  trace  the  periodic  orbits  and  this  led  us  to  the 
discovery  of  the  "evaporation"  of  limit  cycles  as  they 
encounter  the  separatrices  in  the  phase  space.  This 
coincides  with  the  period  of  the  oscillation  becoming 
infinite.  An  additional  bit  of  structure  uncovered  is 
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TYPICAL  PHASE  PLOTS 
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Figure  4.  Classes  of  Possible  Phase  Plots  for  the 
CSTR. 


that  there  appear  to  be  only  9 distinct  types  of  phase  plots 
possible  for  this  problem  (denoted  A,B--J  in  Figures  3 and 
4),  and  their  evolution,  as  Da  is  varied,  may  be  seen 
clearly . 

In  an  effort  to  aid  the  experimentalist  who  may  most 
easily  vary  the  reactor  residence  time,  T,  in  the 
laboratory  (rather  than  Da),  the  system  (1-2)  was 
reparameterized  [137]  by  letting 

Da  = Da  t 

(8) 

6 = 60t 


where  DaQ,  6^ 


become  reference  constants  (in  fact 


6 


o 


can 
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be  made  a unique  function  of  B)  so  that  the  parameters  of 
interest  are  B,  DaQ,  y , x2c'  T • To  our  surprise,  the 
structure  of  the  problem  was  greatly  complicated  by  this 
simple  transformation.  The  steady  state  bifurcation  was 
found  to  have  one  of  the  five  types  of  branching  shown 
around  the  perimeter  of  Figure  5 and  the  regions  in 
parameter  space  where  each  of  these  types  of  behavior  are 
found  is  shown  in  the  center  of  the  figure.  Of  particular 
interest  are  the  isolated  branches  (isola)  and  the  exist- 
ence of  as  many  as  4 branch  points  as  t varies. 

The  bifurcation  to  periodic  solutions  was  even  more 
complicated,  showing  enormously  fine  detail  of  structure 
[137].  A summary  of  the  types  of  steady  state  and 
periodic  solution  branching  is  shown  in  Figure  6.  There 


T — * 


Figure  5.  Multiple  Steady  State  Branching  as  Reactor 
Residence  Time  is  Varied. 
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Figure  6.  Summary  of  Steady  State  and  Periodic  Orbit 
Branching  for  the  CSTR  as  Residence  Time  is 
Varied . 

were  some  17  distinct  regions  in  parameter  space  where  this 
behavior  was  found  [137],  One  should  note  that  the  phase 
plots  shown  in  Figure  3 remain  the  same  for  the  reparam- 
eterized problem,  only  the  sequence  of  their  appearance 
changes  as  T varies. 

It  may  be  of  interest  to  note  how  the  branching  to 

multiple  solutions  found  in  this  problem  may  be  interpreted 

from  the  point  of  view  of  catastrophe  theory  [28].  Blum 

and  Mehra  [153]  have  parameterized  this  problem  essentially 

as  shown  in  Equations  (3,4)  and  find  only  the  familiar  cusp 

in  a space  of  x„  , Da,  8 (cf  Figure  7).  More  recently 
2 s 

Aris  [154]  has  analyzed  the  bifurcation  to  multiple  steady 
states  shown  in  Figure  5 and  seems  to  have  found  a very 
interesting  catastrophe  surface.  As  shown  in  Figure  8,  for 
b between  pt.  G s D (in  Figure  5),  one  obtains  a cusp  with 
an  attached  wing  in  an  xis'  Dag'  T sPace  while  for  B 
above  point  D,  one  seems  to  have  an  overhanging  wing  (cf 
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Figure  7.  Bifurcation  with  Respect  to  Da, 
Cusp  Catastrophe. 


a Simple 


Figure  9).  However,  it  appears  that  this 
only  a reparameterization  of  the  cusp  and 
sent  a new  catastrophe. 


surface  is 
does  not' repre- 


CONCLUDIHG  REMARKS 

It  should  be  clear  from  the  scope  of  the  literature 
cited  and  the  problems  discussed  that  a very  large  number 
of  chemically  reacting  systems  exhibit  multiple  steady 
states,  periodic  solutions,  wave  phenomena,  and  pattern 
formation.  In  addition,  many  of  these  reaction  systems 
have  great  scientific  and  practical  importance  and  are  of 
keen  interest  to  natural  scientists  and  engineers. 
Although  it  seems  clear  that  a proper  understanding  of 
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the  tools  arising  from  bifurcation  theory  is  the  key  to 
unravelling  the  structure  of  these  important  problems,  a 
major  responsibility  for  communicating  these  ideas  in  a 
understandable  form  to  the  user  lies  with  the  applied 
mathematics  community.  At  th®  same  time,  the  scientist 
and  engineer  must  provide  stimulation  to  the  development 
of  the  theory  through  reporting  observed  phenomena, 
developing  mathematical  models  as  candidates  for  analysis 
and  providing  a testing  ground  for  application  of  the 
theory.  It  is  only  through  this  type  of  mutual  communica- 
tion that  the  most  fruitful  development  and  application  of 
bifurcation  theory  can  occur. 

In  the  spirit  of  this  exchange,  the  present  discussion 
of  potential  applications  is  offered  along  with  some  naive 
suggestions  for  future  research  directions: 

(i)  The  development  of  some  easily  applied  conditions 
for  the  existence  of  isolated  steady  state 
branches  (isola)  [137]  and  of  isolated  periodic 
orbits . 

(ii)  Contributions  towards  a more  comprehensive 

distributed  parameter  bifurcation  theory  which 
allows  a straightforward  treatment  of  bifurcation 
from  nonuniform  steady  states  and  wave  motion, 

(iii)  The  development  of  a unified  theory  of  relaxation 
oscillations  and  the  relationship  to  steady  state 
branching  behavior. 

(iv)  Studies  of  conditions  for  synchronization  of 

oscillations  when  separate  oscillating  systems 
are  allowed  to  communicate  through  mass  or 
heat  exchange  (e.g.,  [14,16,17]). 
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Chemical  Wave-Trains  and  Related 
Structures 

L.  N.  Howard 


1.  The  kind  of  waves  with  which  this  lecture  is  concerned 
do  not  involve  any  macroscopic  mechanical  motion,  but  depend 
on  the  interaction  of  certain  chemical  reactions  with  dif- 
fusion. An  appropriate  mathematical  model  is  provided  by 
"reaction-diffusion  equations"  of  the  form: 

Cfc  = F(C)  + KV2C  . (1) 

Here  C is  a vector  whose  components  represent  the  concen- 
trations of  various  substances  which  are  participating  in 
certain  chemical  reactions.  Because  of  these  reactions, 
the  concentrations  are  changing  with  time — the  rate  at  which 
this  occurs  is  given  by  the  vector  F , which  is  a function 
of  the  concentrations  themselves,  and  possibly  of  certain 
other  things,  notably  temperature.  (If  temperature  varia- 
tions are  important  one  may  make  temperature  one  of  the 
components  of  C and  retain  the  form  (1).)  In  addition, 
if  the  concentrations  are  not  spatially  uniform,  diffusion 
will  also  contribute  to  the  local  rates  of  change;  this  is 
represented  by  the  last  term  of  (1) , in  which  K is  a posi- 
tive definite  matrix  of  dif fusivities . 

The  equations  of  "chemical  kinetics",  Cfc  = F(C)  , 
(which  might  be  obtained  explicitly,  for  a given  set  of 
reactions,  on  the  basis  of  the  law  of  mass  action  for  in- 
stance) would  normally  have  an  absolutely  stable  critical 
point  representing  the  final  equilibrium  state  after  all  the 
reactions  have  "run  down".  However,  in  some  cases--and  this 
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includes  those  of  interest  in  connection  with  chemical  waves 
— this  final  state  is  not  approached  for  a long  time,  and 
interesting  things  happen  on  shorter  time-scales.  It  is 
then  possible  (and  appropriate)  to  make  approximations 
which  produce  a simpler  set  of  equations,  of  the  same  gen- 
eral form  as  (1),  usually  for  a subset  of  the  original  set 
of  concentrations.  (The  simplest  example  of  such  approxi- 
mations, appropriate  for  instance  if  some  of  the  substances 
are  initially  distributed  fairly  homogeneously  and  in  con- 
centrations large  compared  to  the  amount  by  which  they 
change  over  the  short  time-scales  otherwise  of  interest,  is 
that  these  particular  concentrations  may  be  taken  as  approx- 
imately constant  in  the  equations  for  the  rates  of  change 
of  the  others.)  The  approximate  chemical  kinetics  equa- 
tions, even  though  they  describe  accurately  the  course  of 
the  reactions  on  the  short  time-scale,  need  not  then  have 
an  absolutely  stable  critical  point,  and  might  for  instance 
have  a stable  limit-cycle  solution.  (They  also  need  not 
particularly  resemble  the  polynomial  form  given  by  the  law 
of  mass  action.) 

A (plane)  wave-train  solution  of  Equation  (1)  is  a 
solution  of  the  form  C = y(ot  - az)  where  a and  a are 
constants,  z is  a spatial  variable  (normal  to  the  wave 
crests)  and  y is  a ?T7-periodic  function  of  its  argument. 

Such  solutions  to  (1)  exist  if  and  only  if  the  system  of 

> 

ordinary  differential  equations 

y'  = F (y ) + a2Ky"  (2) 

has,  for  suitable  constants  o (frequency)  and  a (wave 
number),  a 2ir-periodic  solution.  Note  that  the  order  of 
(2)  is  twice  that  of  the  (approximate)  equations  of  chemi- 
cal kinetics,  = F(C)  . In  general,  (2)  probably  has  no 

2 x-periodic  solutions,  no  matter  what  o and  a are.  But 
with  certain  hypotheses  about  F it  can  be  shown  that  it 
does,  and  in  fact  then  has  typically  a one-parameter  family 
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of  them:  for  each  a in  a certain  range  there  is  a a , 

2 

say  o = H(a  ) , such  that  (2)  has  a 2:r-periodic  solution 
ya  . This  was  shown  by  Kopell  and  me  in  [1] , under  the 
hypothesis  that  the  equations  = F (C)  of  the  kinetics 
have  an  unstable  critical  point  at  which  the  linearization 
has  two  conjugate  eigenvalues  in  the  right  half  plane.  It 
was  also  assumed  that  K was  a scalar  matrix,  or  nearly  so. 
The  proof  made  use  of  the  Hopf  bifurcation  theorem,  so  this 
result  should  perhaps  be  mentioned  at  a conference  on  ap- 
plications of  bifurcation  theory.  Unfortunately,  the  wave 
trains  given  by  this  turn  out  to  be  unstable  as  solutions 
of  (1),  so  probably  the  result  should  be  mentioned  only  at 
such  conferences.  However,  we  were  also  able  in  that  paper 
to  show  the  existence  of  a one  parameter  family  of  plane 
wave  solutions  for  all  sufficiently  small  a , under  the 
hypothesis  that  the  kinetics  equations  have  a stable  limit- 
cycle  solution.  It  appears  to  be  the  case  that  these  waves 
are  stable,  as  solutions  of  (1),  for  a certain  range 

0 < a < a of  wave  numbers  (at  least  if  the  diffusivity 
— — s 

matrix  is  sufficiently  close  to  being  scalar) . This  was 
not  obtained  by  a bifurcation  method,  but  by  exploiting  the 
singular  perturbation  character  of  (2)  for  small  a ; this 
also  provides  the  basis  of  an  effective  numerical  method 
for  calculating  such  solutions  of  (2) . Topological  methods 
have  also  been  used  to  show  the  existence  of  plane  wave 
solutions  in  some  circumstances.  Roughly  speaking,  it  seems 
that  wave-train  solutions  are  to  be  expected  when  the 
chemical  kinetics  is  "oscillatory",  as  in  the  cases  just 
mentioned,  and  also  in  some  other  cases,  often  described 
as  "excitable".  Here,  although  the  chemical  kinetics  equa- 
tions have  an  absolutely  stable  critical  point,  a suitable 
small  (but  finite)  perturbation  will  send  the  phase-point 
off  on  a long  excursion  before  it  once  again  approaches  the 
critical  point. 
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2.  Perhaps  the  simplest  examples  illustrating  wave  train 
solutions  to  reaction-diffusion  equations  are  the  "X  - w 
systems"  whose  kinetic  equations  are  of  the  form 


(3) 


where  X and  w are  functions  of  r » ^ci  + C2  • (Here 
and  C2  are  to  be  thought  of  as  deviations  of  the 
concentrations  from  a critical  point,  so  that  they  need  not 
be  positive.)  If  X (r)  has  a zero  at  some  radius,  say 
r = 1 , and  is  decreasing  there,  then  (3)  has  the  circle 
r * 1 as  the  orbit  of  a stable  limit  cycle,  assuming 
oj(l)  ? 0 . Such  systems  are  a favorite  example  for  il- 
lustrating limit  cycles  in  elementary  differential  equa- 
tions since  the  introduction  of  polar  coordinates  in  the 
phase  plane  makes  them  particularly  transparent.  Polar 
coordinates  (C^  + iC2  = re10)  are  also  helpful  for  reac- 
tion-diffusion equations  with  F(C)  given  by  the  right 
hand  side  of  (3) . Taking  K to  be  the  identity  matrix, 
one  finds,  in  the  case  of  a single  space  coordinate  z : 


rt  * r (X  (r)  - fl‘)  ♦ rzz 


°t  = U)(r)  + \ {t2qz)z 


(4) 


For  these  systems,  the  family  of  plane  waves  can  be  written 
down  explicitly,  and  is: 


0 * ot  - az 


where 

2 

u)(rQ)  , a * w(rQ) 


0 * 
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For  those  values  of  the  "amplitude  parameter"  rQ  for  which 

A > 0 , we  get  plane  wave  solutions;  for  instance  with 
2 2 

A=l-r  , u>  = 2 - r , there  are  plane  waves  for  all  a 
on  (0,1)  , the  wave  of  wave  number  a having  frequency 
o * 1 + a2  (S  H(a2))  . In  11],  the  (linear)  stability  of 

these  waves,  as  solutions  of  the  reaction  diffusion  equa- 
tions, was  investigated.  The  principal  conclusion  was  that 
they  are  not  always  stable,  but  those  close  enough  to  the 
stable  limit  cycle  of  the  kinetics  (i.e.,  those  with  small 
enough  a)  are  stable. 

3 . The  Belousov  reaction  is  probably  the  most 
thoroughly  studied  oscillatory  reaction,  though  there  is 
still  some  uncertainty  about  details  of  the  basic  chemical 
mechanisms.  Under  appropriate  conditions,  it  exhibits  self- 
excited  nearly  periodic  oscillations,  in  which  the  concen- 
trations of  some  of  the  reactants  change  by  several  orders 
of  magnitude,  with  a period  of  the  order  of  30  seconds  (the 
"short  time-scale”).  The  period  and  wave-forms  gradually 
change  with  time  as  the  major  ingredients  are  used  up,  but 
the  oscillation  persists  for  an  hour  or  two  (the  "long  time- 
scale").  For  references,  one  may  consult  [2].  Under  other 
conditions,  the  same  system  is  excitable,  but  not  self- 
excited.  It  is  also  with  this  system  that  real  chemical 
waves  have  been  most  clearly  demonstrated,  first  by  Zaiken 
and  Zhabotinskii  [3] . If  the  reacting  mixture  is  placed  in 
a shallow  layer  in  a covered  flat  dish,  one  sees  after  a 
while  an  organized  pattern  of  sets  of  concentric  rings — or 
sometimes  spirals--made  visible  by  color  changes  in  an  in- 
dicator. Figure  1 is  a photograph  of  such  a pattern. 

Careful  observation  shows  that  the  rings  are  propagating 
slowly  outward  from  the  centers  of  the  "targets".  Where 
adjacent  targets  meet,  the  meeting  waves  seem  simply  to 
annihilate  one  another,  indicating  their  highly  non-linear 
character.  The  spacing  of  the  rings  and  the  period  of 
oscillation  are  quite  uniform  over  a given  target,  but  are 
not  usually  the  same  in  different  targets,  nor  is  the 
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propagation  speed.  Besides  the  waves  themselves,  two  other 
striking  features  of  pictures  like  Figure  1 are  the  centers 
and  the  boundaries  between  different  targets,  the  rather 
thin  transition  zones  between  regions  of  essentially  uniform 
period  and  wave  length.  It  j.s  in  trying  to  make  a theoreti- 
cal model  of  such  transition  zones  that  Kopell  and  I have 
recently  found  some  useful  applications  of  bifurcation 
methods.  The  centers  appear  to  us  to  be  somewhat  different, 
and  other  methods  seem  more  appropriate.  I shall  now  des- 
cribe these  transition  zone  models  and  some  mathematical 
issues  connected  with  them.  A detailed  account  will  appear 
in  [4]. 

4.  The  observed  boundaries  separating  different  targets  are 
not  usually  fixed  in  position,  but  propagate  slowly  from  a 
target  with  higher  frequency  into  a neighboring  one  of  lower 
f requency--indeed , a target  will  sometimes  completely  de- 
vour its  neighbor  or  neighbors.  But  during  this  process, 
the  transition  zone  remains  thin  and  well-defined.  This 
suggests  that  an  appropriate  one-dimensional  idealization 
of  the  transition  zone  is  a solution  which  provides  a con- 
nection between  two  different  plane  waves  and  which,  in  a 
suitably  moving  coordinate  system,  retains  a permanent 
structure.  A plane  wave  is  a "permanent  structure"  in  that 
at  each  point,  the  concentrations  are  periodic  functions  of 
time,  and  in  fact,  in  a coordinate  system  moving  with  the 
wave  propagation  speed  o/a  it  appears  as  a steady  struc- 
ture. We  cannot  expect  to  find  a moving  system  in  which 
the  transition  zone  solution  appears  steady,  but  we  shall 
try  to  find  one  in  which  the  solution  is  everywhere  periodic 
with  the  same  period.  Thus,  we  shall  look  for  a solution  C 
to  the  reaction-diffusion  equations  expressible  as 
C = X(ot,z  - ct)  , where  X is  a 27r-periodic  function  of 
its  first  argument  with  the  asymptotic  behavior 
X(ot,z  - ct)  ^ y^  (o+t  - a+z)  for  z •*  + <»  . 


Setting 
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C = z - ct  for  the  position  coordinate  in  the  moving  system 
this  means  that 

X (ot, c)  ^ ya  ( (a+  - a+c) t - a+t) 

as  c . Thus,  for  the  above  to  be  possible,  we  must 

have  the  two  equations 

o = o+  - a+c  (5) 

from  which  the  propagation  speed  c of  the  transition  zone 
and  the  "Doppler  shifted"  common  frequency  a can  be  ob- 
tained in  terms  of  the  frequencies  and  wave  numbers  of  the 
two  asymptotic  plane  waves.  The  result  of  solving  for  c 
is : 


c = Aa/Act 


(6) 


the  finite-difference  analogue  of  the  group-speed  do/da 
of  the  family  of  plane  waves.  This  rule  (6)  for  the  speed 
of  a transition  zone  appears  to  be  well  verified  experi- 
mentally. It  was  noticed  already  by  Zaiken  and  Zhabotinskii . 

Let  us  consider  now  the  \ - go  case,  where  once  again 
the  polar  coordinates  in  the  concentration  space  prove  to 
be  helpful.  Transformed  to  the  moving  system,  the  equa- 
tions (4)  become: 


rfc  - cr^  = r (X  - 0(.)  + r?{ 


0fc  " C0c  = w + 4 (r20?)  c 


(7) 


It  is  readily  seen  that  these  equations  are  consistent  with 
a solution  of  the  form  r = r(^)  , 0 = at-  f ^ a ( c, ' ) d£ * , 

Jo 
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which  will  make  the  concentrations  27T-periodic  functions 
of  at  for  each  £ . in  this  way,  our  problem  is  trans- 
formed into  the  third  order  system  of  ordinary  differential 
equations  for  r , u = r'/r  , and  the  "local  wave-number" 
a : 


r 1 = ru 

2 2 

u ' = a _ u - X (r ) - cu 
a'  = w(r)  - a - 2au  - ca 


(8) 


This  system  has  critical  points  where  u = 0 , 

2 

a + ca  = cu(r)  and  a = X (r)  ; eliminating  r through 
2 - 1 2 

H(a  ) = w(X  (a  ) ) , this  gives  the  equation 

a + ca  * H(a2)  (9) 

which  has  the  solutions  a = a+  (cf.  (5)).  Thus,  the  two 
plane  waves — which  are  indeed  solutions  of  the  special  form 
under  consideration  which  also  have  r and  a constant — 
appear  as  critical  points  of  the  system  (8) . The  transi- 
tion zone  solution  which  we  are  seeking  is  then  a trajectory 
of  (8)  which  goes  from  the  critical  point  with  a = a to 
that  with  a * ct+  . Let  us  assume  now  that 

all'  (a2)  > 0 . (10) 

This  appears  to  be  the  case  for  the  real  waves  in  the  Belou- 
sov reaction,  where  shorter  waves  seem  to  have  higher  fre- 
quency. For  a X - cu  case,  it  corresponds  to  oj'(r)  < 0 
if  we  also  have  X ' (r)  < 0 , as  is  necessary  (over  the 
relevant  range  of  r)  for  stability  of  the  plane  waves.  A 
sketch  of  the  two  sides  of  equation  (9) , plotted  as  func- 
tions of  a , would  then  look  like  this: 
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ing  the  critical  points.  Instead  of  thinking  of  a+  as 
the  basic  parameters,  we  now  think  of  them  as  being  c and 
o , and  in  fact  consider  a family  of  problems  which  c 
fixed  but  a variable.  By  reducing  a from  the  value 
corresponding  to  the  original  a+  , we  slide  the  line  in 
Figure  2 downwards,  thereby  bringing  the  two  critical 
points  closer  together  until  they  finally  coalesce  (and 
disappear)  when  o = oq  and  the  line  becomes  tangent  to 
the  "parabola".  This  indicates  how  bifurcation  methods  can 
be  made  use  of  in  the  critical-point  connection  problem  to 
which  finding  the  transition  zone  solution  has  been  reduced. 
We  start  at  oq  , where  the  critical  points  just  come  into 
existence,  and  push  a back  up  in  the  expectation  that  we 
shall  at  least  be  able  to  say  something  for  cases  where  a+ 
and  a_  are  not  too  different. 

Critical-point  connection  problems  occur  in  the  problem 
of  finding  the  structure  of  a shock  layer  in  viscous,  heat- 
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conducting,  gas  dynamics.  There,  the  states  being  connected 
(at  least  in  the  idealized  case)  are  physically  constant  in 
time,  and  the  whole  shock-structure  is  too,  if  examined  in 
a coordinate  system  moving  with  the  shock  speed  determined 
by  the  Rankine-Hugoniot  conditions — of  which  Equation  (6) 
is  here  the  analogue.  For  the  chemical  waves,  the  states 
being  connected  are  not  really  constant  in  time,  but  perio- 
dic; they  have  been  made  to  look  like  constant  states  only 
after  some  mathematical  transformations,  notably  the  use  of 
the  polar  coordinates  in  concentration  space,  a device  which 
seems  restricted  to  the  very  special  (and  quite  non.-chemi- 
cal)  X - to  models.  Still,  the  analogy  seems  to  be  quite 
close,  and  we  have  come  to  think  of  the  transition  zones  as 
"shocks". 

5.  Here  is  a bifurcation  theorem  which  enables  one  to  show 
the  existence  of  a connecting  trajectory  for  Equations  (8) 
when  a+  and  a_  are  sufficiently  close  to  one  another — 
the  "weak  shock"  case: 

Let  x = (x)  = yg  + Ax  + Q(x,x)  + E.E.  be  a family 

of  autonomous  systems  of  dimension  n depending  on  the 
parameter  y , where  E.E.  represents  "everything  else" — 
terms  of  higher  order  in  x and  y than  those  written 
explicitly.  About  the  vector  g , matrix  A and  vector- 
valued quadratic  form  Q we  assume  the  following: 

i)  rank  A = n - 1 . Let  e and  i be  right  and  left 
eigenvectors  of  A corresponding  to  the  eigenvalue  0 , 
which  has  multiplicity  k . (e  and  J l are  unique  in  di- 
rection . ) 

ii)  Ig  ? 0 . 

(Under  these  hypotheses,  there  exists  a locally  unique 
curve  of  critical  points  in  (x,y)  space  passing  through 
the  origin.) 

iii ) HQ(e,e)  ? 0 . 

(With  this  additional  hypothesis,  there  are  exactly  two 
critical  points  for  all  small  positive  values  of 

A 

y = - yUg/HQ (e,e)  , and  no  others  near  the  origin.) 
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iv)  k = 1 or  3 . 

Then  there  is  a locally  unique  trajectory  joining  the 
critical  points  for  all  small  positive  values  of  u . 

(A  proof  of  this  and  of  related  results  for  k = 2 can 
be  found  in  [5 ] . ) 

If  one  investigates  the  bifurcation  point  o - oq  of 

Equations  (8)  one  finds  that  all  the  hypotheses  of  this 

theorem  are  satisfied,  with  k = 1 , and  so  we  can  conclude 

the  existence  of  the  idealized  transition  zone  structure  in 

the  weak-shock  case.  In  the  paper  [4],  the  existence  of 

connecting  trajectories  for  certain  strong  shock  cases  is 

also  demonstrated,  this  time  by  an  application  of  the  above 

theorem  in  the  k = 3 case.  The  connecting  trajectory 

given  by  the  bifurcation  theorem  goes  in  a definite  direc- 
2 

tion;  when  aH' (a  ) ^ 0 this  turns  out  to  mean  that  the 
transition  zones  exist  between  wave  trains  which  are  ap- 
proaching one  another  from  opposite  directions,  or  more 
generally,  when  the  one  on  the  left  has  a larger  group 
velocity  than  the  one  on  the  right.  In  the  opposite  case, 
the  shock  structure  does  not  exist  (as  it  does  not  for 
"expansion  shocks"  in  gas  dynamics) , at  least  between 
sufficiently  close  wave  trains. 

6.  The  success  of  the  bifurcation  method  in  the  A - u> 
case  led  us  to  attempt  to  treat  the  general  case  in  a simi- 
lar way.  Formally,  this  can  be  done,  but  in  order  to  make 
the  plane  waves  look  like  critical  points,  it  seems  to  be 
necessary  to  think  of  differential  equations  in  an  infinite 
dimensional  space  rather  than  just  the  3 of  (r,u,a).  To 
do  this,  we  consider  the  concentration  vector  in  the  moving 
system  X(t,c)  as  being,  for  each  £ , a point  in  a space 
of  vector-valued  periodic  functions  of  t . One  can  sepa- 
rate the  ideas  of  "amplitude"  and  "phase"  (paralleling  the 
r and  0 of  the  A - w case)  in  the  following  way. 

Since  we  are  going  to  use  a bifurcation  method  centered  on 
a certain  plane-wave  yQ  , we  may  suppose  that  all  the 
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orbits  (for  various  values  of  £)  of  the  periodic  functions 
X(«,C)  are  near  one  another.  We  can  then  give  a consis- 
tent rule  for  marking  on  any  such  orbit  a standard  point 
called  the  "phase-zero" — for  instance,  we  may  take  a hy- 
perplane in  concentration  space  transverse  to  these  orbits 
and  call  the  phase-zero  the  point  at  which  this  hyperplane 
is  crossed  from  one  side  to  the  other.  (In  [4]  a somewhat 
different  definition  was  found  technically  more  convenient, 
but  the  idea  is  similar.)  We  then  think  of  a space  Pfi  of 
"normalized  2Tr-periodic  functions"  as  vector  functions 

p(0)  whose  orbits  are  near  that  of  y and  which  have 

a 

o 

their  phase-zero  at  6=0.  We  then  try  to  solve  the 

shock-structure  problem  by  setting 
C 

X(t ,5)  = U (at  - J a(C')dc',C)  where,  for  each  ^ , U(0,£) 

0 

is  in  . We  may  assume  that  the  representation  of  the 

family  of  plane  wave  solutions  has  been  adjusted  so  that 

for  each  a , ya(0)  is  in  pn  * so  that  in  particular, 

the  limiting  plane  waves  y can  be  regarded  as  points  in 

a+ 

Pn  , with  constant  values  of  a . Inserting  the  above  form 
for  X(t,C)  into  the  reaction-diffusion  equations  in  the 
moving  system,  one  finds  that  the  result  can  be  put  into 
the  form  of  an  autonomous  system 


a^  = h(a,U,V) 


= G (a  , U , V) 


(11) 


where  U is  in  P , V is  in  a space  P which  is  like 
P^  but  with  a slightly  different  normalization  produced  by 
the  £-dif ferentiation,  and  a is  in  R , the  reals,  h is 
a mapping  from  (part  of)  RxPnxPQ  -*•  R and  G is  a similar 
one  into  P 

o 
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In  this  form,  the  problem  looks  formally  just  as  before — we 
are  seeking  a trajectory  in  RxPnxPQ  joining  two  critical 
points  of  the  system  (11)  . It  is  then  possible  to  identify 
the  analogues  of  the  vector  g , the  linear  operator  A 
and  the  quadratic  form  Q mentioned  in  the  bifurcation 
theorem.  Under  assumptions  which  are  natural  about  the 
plane  wave  solutions  (such  as  their  stability) , one  then 
finds  that  all  the  hypotheses  of  the  bifurcation  theorem 
for  k ■ 1 — apart  from  the  finite  dimensionality — come  out 
exactly  right.  It  thus  seems  a very  plausible  conjecture 
that  the  idealized  weak-shock  structures  exist  in  general. 

However,  there  seems  to  be  a serious  difficulty  in 
proving  this  along  the  lines  sketched  above.  There  are 
certainly  many  circumstances  in  which  it  makes  sense  to 
talk  about  differential  equations  in  Banach  spaces,  and 
there  is  probably  an  analogue  of  the  bifurcation  theorem 
stated  above  in  such  a context.  But  it  appears  that  the 
operators  h and  G in  (11)  , which  involve  second  deri- 
vatives with  respect  to  0 , are  too  singular  to  really 
define  a vector  field  on  RxP  xPq  . The  underlying  reason 
for  this  is  that  c is  really  a spatial  variable  in  the 
reaction-diffusion  equations,  but  in  forcing  the  equations 
for  the  shock-structure  into  a form  resembling  the  bifurca- 
tion theorem,  we  obtain  equations  of  evolution  in  £ . 
Everybody  knows  that  it  is  at  least  half  as  dangerous  to 
try  to  solve  the  heat  equation  sideways  as  it  is  to  do  it 
backwards!  Nevertheless,  we  know  that  connecting  trajec- 
tories exist  for  the  X - u>  case,  and  have  good  physical 
reasons  to  expect  that  they  do  for  general  chemical  kinetics 
which  have  plane  wave  solutions.  This  suggests  then,  that 
finite  dimensional  bifurcation  theorems,  like  the  one  stated 
above,  which  assert  the  existence  of  connecting  trajectori- 
es as  well  as  of  critical  points,  may  have  some  other  kind 
of  infinite  dimensional  generalization  which  does  not  fit 
into  the  context  of  differential  equations  on  a Banach 
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space.  We  suspect  that  Equations  (11)  do  make  sense  on 
some  very  restricted  part  of  the  domain,  a part  which  con- 
tains the  connecting  trajectory  of  interest  but  perhaps 
not  a whole  lot  else.  Possibly  if  these  things  were  under- 
stood in  a somewhat  mere  general  way,  we  should  find  a new 
kind  of  bifurcation  theorem. 
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A General  Class  of  Nonlinear  Bifurcation 
Problems  from  a Point  in  the  Essential 
Spectrum.  Application  to  Shock  Wave 
Solutions  of  Kinetic  Equations 

B.  Nicolaenko 


1 . Introduction  and  Background  from  Mechanics. 

We  investigate  an  abstract  class  of  bifurcation 
problems  from  the  essential  spectrum  of  the  associated 
Frechet  derivative.  This  class  is  a very  general  framework 
for  the  theory  of  one-dimensional,  steady  profile  traveling 
shock  wave  solutions  to  a wide  family  of  kinetic  integro- 
differential  equations  from  non-equilibrium  statistical 
mechanics  [1,2].  Such  integro-dif ferential  equations  usu- 
ally admit  the  Navier-Stokes  system  of  compressible  gas  dy- 
namics or  the  M.H.D.  systems  in  plasma  dynamics  as  a singu- 
lar limit  [3-5],  and  exhibit  similar  viscous  shock  layer 
solutions  [6,7] . 

The  mathematical  methods  associated  to  systems  of 
Partial  Differential  Equations  must  however  be  replaced  by 
the  following  considerably  more  complex  Bifurcation  Theory 
setting,  first  outlined  in  [8-10]  for  special  cases.  We 
actually  consider  a hierarchy  of  bifurcation  problems, 
starting  with  a simple  (solved)  bifurcation  problem  from  a 
simple  eigenvalue. 

Let  f)  be  a nonlinear  mapping  from  a Banach 

space  X,  into  a Banach  space  Y,  parametrized  by  p: 


(1.1) 

Consider 

(1.2) 

such  that 


Q 


(u,f) 


X X -*•  Y. 


Qw,  f>  = o, 

Qlv.O)  = 0,  V P G R1. 
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We  admit  all  the  necessary  hypotheses  to  insure  bifurcation 
at  u = u*,  from  a simple  isolated  eigenvalue  of  the  Frechet 
derivative 


Qt 


(u\0) . 


Classical 
borhood  of  ( p * , 0 ) 
w(p)  : 

(1.3) 


theory  [11]  insures  that,  in  some  neigh- 
in  X x,  there  exists  a second  branch 


(p , w(vi)  ) 
w(vi*)  = 


= 0 


0. 


Thus,  the  primary  hypothesis  is  bifurcation  from  a simple 
eigenvalue  for  the  operator  . In  concrete  cases,  the  rel- 
ative bifurcated  and  trivial  branches  correspond  to  differ- 
ent asymptotic  steady  states  at  the  "tails"  of  the  shock 
wave  (space-independent  subsonic  and  supersonic  states  re- 
lated by  Rankine-Hugoniot  conditions;  p = y*  corresponds  to 
the  transonic  regime) . 

We  shall  actually  investigate  the  more  involved  ex- 
tended operator  equation,  for  x e , - oo  < x < + oo  ; 


(1.4) 

(1.5) 


A ( P ) f)  - 0, 

^^(p,f)  = 0,  where 
A ( p ) ; R1  X X + Y 


or 


is  a linear  operator  from  X into  Y,  parametrized  by  p e R^ . 
f is  now  a vector  valued  function  of  x £ R1 , - °°  < x < + °° 
with  values  in  the  Banach  space  X.  We  may  restrict  our- 
selves to  spaces  of  absolutely  continuous  functions.  If 
A ( p ) = I and  x = t,  (1.5)  reduces  to  an  evolution  equation 

(1-6)  If  - - 0. 


and  one  looks  for  solutions  which  are  trajectories  between 
critical  points  of  (1.1),  i.e.,  the  trivial  solution  and 
the  bifurcated  solution  w(p).  Such  a problem  (1.6)  of  traj- 
ectories joining  two  steady  asymptotic  states,  has  first 
been  considered  by  B.  Matkowsky,  using  matched  asymptotic 
expansions  [12,13];  it  has  been  investigated  in  depth  by 
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G.  Iooss  [14]  and  K.  Kirchgassner  [15],  within  the  Navier- 
Stokes  context  (see  also  [16]). 

However,  in  (1.4-5),  fundamental  properties  of  the 
physical  context  impose  somewhat  pathological  conditions  on 
A(p)  : 

Hypothesis . 

1)  Zero  belongs  to  the  continuous  spectrum  of  A(p), 
V p , i . e . : 

R(A(p))  = Y and  A(p)f  = 0 * f = 0. 

2)  A(p)  is  neigher  positive  nor  negative  semi-definite,  nor 
is  it  accretive.  As  a corollary  A(p)  1 does  not  exist,  Vp. 
Recall  that  _ . 

#<-*•  °> 

does  not  exist  either  at  p = p*.  In  fact  the  properties  of 
A ( p)  are  such  that  an  initial  value  problem  for  (1.4)  is 
ill-posed.  Attempts  to  straightforwardly  extend  methods 
developed  for  (1.6)  lead  to  erroneous  results. 

We  still  look  for  critical  trajectories  of  (1.4), 
between  the  trivial  solution  and  w(p).  We  investigate  the 
possible  existence  of  a branch  fi(p,  x) , solution  of  (1.4) 
such  that: 

1)  ft ( p , x)  = o,  but  fi(p  , x)  ^ o,  p^p; 

2a)  fi(p,  •■»)  = 0,  fi(p,  +»)  = co  ( p ) ; or 

2b)  fi(p,  +<»>)  = 0,  Q(p,  -°o)  = cu  ( p ) , 

★ 

for  p close  to  p . In  an  appropriate  Banach  space  of  ab- 
solutely continuous  functions  normalized  at  ± °°,  the  hypo- 
thetical non-trivial  branch  ft(p,  x)  corresponds  to  bifur- 
cation from  the  essential  spectrum  of: 

d-7*  ^f(y,0)  = a ( p ) £ - ^yf(p,0)  . 

★ 

Specifically  at  p = p , zero  is  a limit  point  of  the  spec- 
trum (a  non-isolated  eigenvalue  in  the  essential  spectrum) . 

that  of  , 0)  . 

individual  essen- 
considered  as  a 
essential  spec- 


The  kernel  is  non-trivial,  as  it  includes 

The  non-isolated  character  stems  from  the 

tial  specra  of  A(p)  and—.  (1.4)  must  be 

0 ^ 

bona-fide  problem  of  bifurcation  from  the 
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trum.  Finally,  we  shall  demonstrate  the  non-trivial  result 
that  A(m,  +<*>)  = u)(y)  or  -°°)  = w(u)  (critical  trajec- 

tory) . Since  will  will  emphasize  the  mathematical  tech- 
niques, we  briefly  review  the  relevance  of  (1.4)  to  fluid 
and  statistical  mechanics. 

Steady  profile  shock  waves  in  compressible  fluid 
dynamics  and  magneto-hydrodynamics  correspond  to  rather  dif- 
ferent mathematical  theories  according  to  the  level  and  com- 
plexity of  the  fluid  dynamical  description.  In  order  of 
increasing  complexity,  one  has  the  well  known  hierarchy  of 
equations,  from  the  Euler  level,  to  the  compressible  Navier- 
Stokes  and  the  Magneto-hydrodynamic  (M.H.D.)  systems;  and 
finally  to  the  Boltzmann  equation  and  the  Kinetic  integro- 
differential  equations  of  collision-dominated  plasmas. 

While  viscosity  terms  are  explicit  in  macroscopic  Navier- 
Stokes  equations,  they  are  implicit  in  kinetic  equations, 
where  they  result  from  explicit  interparticle  collision 
description  on  a microscopic  scale.  H.  Grad  [3-51  has  care- 
fully investigated  the  singular  limit  of  the  Boltzmann 
equation  (for  neutral  gases)  to  the  Navier-Stokes  system 
when  the  mean  free  path  between  interparticle  collisions 
(microscopic  scale)  becomes  very  small  as  compared  to  the 
macroscopic  mean  flow  scale.  His  estimates  do  not  cover, 
however,  the  shock  case. 

Hyperbolic  systems  are  a standard  tool  for  discon- 
tinuous shock  solutions  of  Euler  equations.  Compressible 
Navier-Stokes  systems  exhibit  viscous  shock  layers:  in  one 
dimension,  Gilbard  and  Paolucci  reduced  them  to  a system  of 
nonlinear  autonomous  Ordinary  Differential  Equations  [6,7], 
and  demonstrated  that  the  shock  layer  is  the  unique  traj- 
ectory between  a node  and  a saddle  point.  For  M.H.D.  sys- 
tems, such  concepts  have  been  extended  by  Conley  and  Smoller 
[17],  using  advanced  tools  of  Topological  Dynamics  and 
Global  Analysis.  Yet  none  of  the  above  mathematical  methods 
apply  to  shock  solutions  of  microscopic  kinetic  equations. 
Worse,  it  is  well  known  that  Partial  Differential  Equations 
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approximations  of  the  "13  moments"  type  break  down  at  a 
finite  Mach  number  * 2 (non-existence  of  trajectories  be- 
tween critical  points)  [18].  The  major  problem  is  whether 
one  can  still  consider  the  latter  kinetic  equations  within 
the  framework  of  critical  orbits  between  critical  states. 

Moreover,  there  is  plenty  of  experimental  and  numer- 
ical evidence  for  important  microscopically  originating 
effects  observed  in  shock  layers  ruled  by  integrodif feren- 
tial  kinetic  equations.  Even  in  neutral  gases,  at  small 
Mach  numbers  of  1.2  (weak  shocks),  a 40%  deviation  has  been 
observed  for  the  local  ratio  of  the  heat  diffusion  to  the 
viscosity  coefficients  in  the  shock,  as  compared  to  predic- 
tions from  the  Navier-Stokes  equation  with  Transport  Coef- 
ficients calculated  by  the  time-honoured  Chapman-Enskog 
expansion  [19].  This  deviation  is  especially  marked  in  the 
"hot  (subsonic)  tail"  of  the  shock  [20].  Previous  numerical 
and  experimental  results  have  missed  these  important  dis- 
tortions by  focusing  only  on  the  geometry  of  the  sharp 
transition  profile  [21].  In  (collision-dominated)  Plasmas, 
;ll-scale  microinstabilities  are  as  important  as  large 
s:ale  (M.H.D.)  macroinstabilities,  and  account  for  the  dif- 
ficulties in  obtaining  stable  equilibria  configurations. 

The  simplest  kinetic  equation  - the  Boltzmann  oper- 
ator for  neutral  rarefied  gases,  has  been  worked  out  in 
previous  publications  [8-10],  and  is  summarized  in  the  ap- 
pendix. Technical  estimates  do,  however,  somewhat  dissim- 
ulate the  conceptual  simplicity  of  the  hierarchy  of  Bifur- 
cation Problems. 

Most  of  the  pathology  of  the  mathematical  problem 
stems  from  the  peculiar  properties  of  the  operator 

A(M)  & * 

This  operator  is  nevertheless  universally  present  in  kinetic 
(statistical  mechanics)  equations.  Usually  called  "the 
streaming  operator",  it  represents  transfer  of  very  high 
velocity  particles.  The  latter  account  for  all  deviations 
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observed  from  Navier-Stokes . Physically,  by  traveling  al- 
most instantaneously  in  opposite  upstream  and  downstream 
directions,  these  very  high  velocity  particles  cause  a 
strong  coupling  between  the  asymptotic  "tails"  of  the  shock. 


2 . The  Mathematical  Problem:  Principal  Results. 

We  first  consider  a classical  bifurcation  setting 
for  Problem  I: 

(2.1)  Q(V,  f)  = 0 

where  is  a bounded  nonlinear  mapping  from  a Banach  space 
X into  a Banach  space  Y (usually  graph-norm  spaces): 


Q(V,  f)  : R1  X X -►  y 
Q(V,  0)  5 0,  Vu  . 


(2.2) 

Cj  is  assumed  to  be  analytic,  both  in  f and  y : 
(2. 3. a)  QW,  f)  = £ ± 

with  the  notations: 

(2. 3.b) 


(0;  (f)“) 


(2. 3.c) 


/7{1)  (0;  f) 


27  (V 


(0;  f,  f)  = f (f,  f) 


(2. 3.d)  2,^7  Q(n)  (0:  (f,<n))  " h^'  f> 


Hypothesis  0. 

a)  Vu,  T is  a Fredholm  operator  of  index  zero. 

b)  at  u = y*,  dim  ker  { T * > = 1 (zero  is  an  isol- 
ated eigenvalue  of  T *). 

c)  t/f  ( 0 ; u * ) h 4 R (T  .),  Vh  e ker  {T^*}. 
Conclusion . In  some  neighborhood  of  (y*,  0)  in  R*  X x, 
there  exists  a second  branch  w(y) : 

(2.4)  (p , w(u))  * 0 and  w(y*)  11  0 . 

Hypothesis  O.d.  The  bifurcation  is  bilateral. 

We  now  investigate  Problem  II.  Let  x 6 R1  , - <*>  < 


x < + «>: 
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(2. 5. a) 

( 2 . 5 . b) 
(2.6) 


A(p)  |^-  - ^(p,f)  = 0,  equivalently 

£/?(p,f)  = 0,  where 

,1 


A ( p ) 


r x x -<•  y 


is  a bounded  (in  graph-norm)  linear  mapping  from  X into  Y. 
^ is  a nonlinear  mapping  acting  on  spaces  of  vector-valued 
absolutely  continuous  functions: 

(2.7)  ^ : R1  X ACI[R1  -*■  X]  -+  R1  X AC  ( R1  + Y]  . 

I 3 f 

The  space  AC  is  such  that  — e AC.  Generally,  the  AC  norms 

3X 

are  defined  by: 


fill  - f*~  llf£ll  ax 

J-  oo  3 X , Y 


3 f 


In  fact,  we  restrict  ourselves  to  spaces  such  that  — is 

3x 


continuous . 
alized : 


The  absolutely  continuous  functions  are  norm- 
f -*■  0 as  x -*•  -oo  or  x -*•  +°°  . 


Now,  Vp,  f = 0 is  still  a trivial  solution.  The  question 
is  whether  there  is  still  bifurcation  for  problem  II,  at 
p = p*,  such  that: 

ft(p,x)  e AC 1 [ R 1 X] 

fi(p*  x ) = 0 

fi(p,-°°)  = 0 or  f2(p,+°°)  = 0 . 

In  the  affirmative,  one  might  speculate  that 


fl  ( p , +°°)  = oj(v)  or  ft(p,-°°)  = u(p)  . 

This  corresponds  to  bifurcation  from  the  essential  spectrum 
of : 

(2.8)  ^?(p;0)  = A (p ) - T . 

r o x m 

Specifically,  at  p = p* , zero  is  a non-isolated  point  of 
the  spectrum,  with 
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To  insure  the  bifurcation,  and  as  suggested  by  statistical 
mechanics,  we  need  the  further 

Hypothesis  1.  Zero  belongs  to  the  continuous  spectrum  of 
the  linear  operator  A(p),  Vp: 

R(A(p))  = Y and  A(p)f  = 0 ° f = 0 . 

Corollary . (A(p))  1 is  unbounded  for  every  p . 

* - 1 

Remark  that,  for  p = p , T * does  not  exist  either. 
Hypothesis  Ibis.  A(p)  is  neither  positive  nor  negative  de- 
finite, nor  more  generally  accretive;  moreover  (X  A(p)  - 
T ) is  not  compact. 

Thus  one  cannot  construct  any  equivalent  norm. 

Hypothesis  2.  The  generalized  spectrum  of  the  operator 

X A ( p ) - T : X -*■  Y , X 6 C, 

is  included  in  two  sectors,  one  in  Re  X < 0,  the  other  in 
Re  X > 0 , uniformly  in  p.  (See  Figure  I) . The  generalized 
spectrum  [27]  is  the  set  of  X such  that  (X  A(p)  - T ) 1 does 
not  exist,  or  is  unbounded  as  a mapping  from  Y to  X.  Remark 
that  X ? Y,  and  A(p)  ^ I. 

From  classical  perturbation  and  invariance  proper- 
ties of  Fredholm  operators,  we  deduce  from  Hypotheses  0 and 
1; 

Theorem  2.1. 

a)  There  exists  a neighborhood  of  X = 0 in  C,  where 

X A ( p ) - T is  Fredholm  of  index  zero,  Vp. 

V*  * . 1 

b)  There  exists  a neighborhood  of  (p  ,0)  in  R X c, 

where  (X  A(p)  - T^)-1  has  a simple  pole  in  X,  corresponding 
to  a simple  generalized  eigenvalue  X (p)  : 

(2.9)  XQ(p)  A ( p ) JQ(p)  - Tp  5o(p)  = 0 , 

A 

where  o ( p ) £ X is  a generalized  eigenfunction.  | 

Hypothesis  3.  XQ(p)  is  real , and 

XQ ( p ) > 0 for  p > p*, 

XQ(p)  < 0 for  p < p*. 

This  last  hypothesis  implies  that  the  linearization  (2.8) 
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From  Hypotheses  0-3,  we  demonstrate  the  fundamental 
Theorem  2.2.  There  exists  a neighborhood  ^/)  of  (p*,0)  in 
R1  X AC  ( -*■  X],  where  there  exists  another  branch  solution 


ft(u,x)  , 

unique 

up 

to  a 

translation,  with: 

a)  p 

> P*  : 

ft  (p 

, -00) 

= 0 

ft  (p 

, +oo) 

= ^ (p) 

b)  p 

< p*  : 

ft(p 

, +°°) 

= 0 

ft  (p 

,-°°) 

= w (p)  , 

0 at  p = 

= P*. 

W(p) 

was 

defined  in  (2.4). 

and  ft(p  , x) 

Specifically,  ft(p,  x)  belongs  to  a closed  subspace  of 

1 a 

AC[R  X]  such  that,  denoting  by  C the  standard  space  of 

Holder  continuous  functions  of  index  a: 


G ca  [R1  X]  n l1  [R1  -*•  X] 

, 2 

A (p)ft  e Ca[R1  ->  Y]  n L1[R1  -*•  Y]  , 

3x 

(with  appropriate  asymptotic  decay  conditions  at  x = + °°, 
specified  in  later  sections).  Here,  0 < a < 1.1 

g 2 

As  a word  of  caution,  note  that  — ft  does  not  exist. 
-1 

Recall  that  (A(p))  does  not  exist,  Vp. 

The  pathology  introduced  bv  the  operator  A(p)  re- 
specifically  new  methods  for  the  bifurcation  Problem  II. 

The  general  line  is  to  attempt  to  rescue  the  time-honored 
Lyapunov-Schmidt  decomposition,  at  the  following  cost: 

1)  The  generalized  Lyapunov-Schmidt  decomposition 
requires  infinite  dimensional  projection  operators.  These 
are  constructed  with  the  help  of  a generalized  Operational 
Calculus,  characterized  by  non- commutativity  properties. 

2)  The  first  generalized  Lyapunov-Schmidt  equation 
is  closely  related  to  the  essential  spectrum  and  represents 
the  "fast  particles  contribution".  It  is  solved  with  the 
help  of  a generalized  operator  inverse;  the  latter  is  con- 
structed with  generalized  holomorphic  semi-groups  which  do 
not  admit  any  infinitesimal  generator. 

3)  The  second  generalized  Lyapunov-Schmidt  equation 


A GENERAL  CLASS  OF  NONLINEAR  BIFURCATION  PROBLEMS 


343 


t 


is  not  a mapping  on  finite-dimensional  spaces.  Rather  it 
is  a Functional-Differential  equation  in  the  sole  variable 
x 6 R^,  -"  < x < +°°,  and  global  (non-local)  in  character: 
the  initial  value  problem  is  ill-posed.  Moreover,  this 
equation  in  itself  is  again  a bifurcation  problem  from  a 
purely  continuous  spectrum. 

We  outline  the  mathematical  techniques  in  the  next 
sections.  Full  details  will  appear  in  [22]  and  elsewhere. 


A Generalized  Operational  Calculus,  and  the  Derivation 


of  the  Generalized  Lyapunov-Schmidt  Equations. 


(3.1) 


R(X,y)  = (X  A (y ) - T ) 


where  T is  defined  as  the  Frechet  derivative 
V 

f = 0 (2 . 3 . b)  . 


of  Qw, 


f ) at 


In  order  to  construct  appropriate  projections  assoc- 
iated to  the  isolated  pole  X (y) , one  cannot  use  the  class- 
ical operational  calculus  based  on  Dunford  Integrals  of 
R(X,y),  since  the  standard  resolvent  identity 


R ( X ) - R ( X ' ) = (X'  - X)  R ( X ) R ( X ' ) 


is  false  (non  commutativity  of  A(y)  and  T ) . It  must  be 
replaced  by  the  following  correct  identities: 


(3. 2. a)  A R ( X ) - A R(X')  = (X'  - X)  A R(X)  A R(X') 

( 3 . 2 . b)  R ( X ) A - R ( X ' ) A = (X'  - X)  R(X)A  R(X')A 

Based  on  (3.2),  a Generalized  Operational  Calculus  is  con- 
structed, characterized  by  anticommutativity  properties. 
Proposition  3.1.  There  exists  two  families  of  projection 
operators : 

E ( X ( y ) ) : Y -*•  Y (range) 

av  O 

E (X  (y))  : X -*•  X (domain) 

cip  O 

associated  to  the  generalized  eigenvalue  XQ(y),  such  that: 


E (X  ) A (y ) = A ( y ) E (X  ) 
av  o ap  o 

E ( X ) T = T E (X  ) = X E A(y)  , 
av  o y y ap  o o av 
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where  r 

Eav  (Xo(u))  = 2?T  / A(u)  dX 

c(Ao} 

Eap  (Vy))  = 2^1/  R(X'^  a<p)  dx 

c(A  ) 
o 

and  c(Aq)  are  closed  contours  around  Aq  in  the  complex  A- 
plane.  | 

(Note:  av  * avant,  ap  * aprfes) . 

Crandall  and  Rabinowitz  [23]  introduced  a somewhat 
related  notion  of  "K-eigenvalue" : A is  a K-eigenvalue  as- 
sociated to  (T  - AK)  if  1)  dim  N (T  - AK)  = codim  R (T  - AK) 
= 1,  and  2)  Kx0  R (T  - AK)  , where  N (T  - AK)  = span  xq  . 

This  notion  can  be  considerably  strengthened  by  the 


Theorem  3.2.  The  necessary  and  sufficient  condition  for  A 
to  be  "K-simple"  is  that  A is  an  isolated  eigenvalue  corres- 
ponding to  a simple  pole  of  (T-AK)-1.  The  generalized  oper- 
ational calculus  always  applies,  with  the  stronger  properties: 


a)  R(T  - A K)  = (I  - E ) Y 

av 

b)  span  Kx„  = E Y .1 

o av  1 

Remark:  Theorem  3.2  is  easily  generalized  to  the  case  of  a 

simple  pole,  but  with  dim  N > 1 (algebraically  K-simple 
eigenvalue) . 

Concerning  Problem  II,  we  have  the 
Corollary  3.3.  These  projections  E (A  (p)  ) and  E (A  (p)  ) 

cl  p O cl  V o 

respectively  reduce  the  domain  and  the  range  of  both 

3 

( AA ( p ) - T ) and  ( A ( p ) - T ),  where  the  latter  is  the 

M oX  p 

Frechet  derivative  of  at  f = 0,  defined  in  (2.8).  | 

We  have,  indeed  the  following  decomposition  diagram,  with 

the  major  problem  of  constructing  a pseudo-inverse  for 

{/It  ( p ; 0 ) from  (I  - E (p))  Y onto  (I  - E ( p ) ) X: 

/ r av  ap 


Y = 


(A(y)  Ti  ~ V 

(3.3)  X = 


(I 


(I  - E (p))  X 
ap 


A*V 

( 

, -i 

av  V 

1 

) ?? 

/ 

E (p)  X 
ap 


I 

I 
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Following  the  general  trend  of  the  Lyapunov-Schmidt 
method,  we  decompose  the  solution  f in  the  space  X as: 


(3. 4. a) 
( 3 . 4 . b) 
(3.4.C) 
( 3 . 4 . d) 


f = EAn  (p)  f + (1  - E (w)  ) f 

a dp 

= c(x)  (p)  + w (p,  x)  , where 
c(x)  i (p)  = E (p)  f, 

U dp 

w (p , x)  = (1  - E (p)  ) f , 
ap 


and  (p)  e X is  the  generalized  eigenfunction  associated 
to  A (p)  and  defined  in  (2.9).  In  fact: 

c ( x ) e A C1  [R1) 

W(p,  x)  e A C [R1  - (I  - E ) X] 

ap 

Applying  the  projection  (I  - E (u))  to  Problem  II  (2.5), 

a v 

using  the  anticommutation  properties  of  Proposition  (3.1) 
and  the  reduction  properties  of  Corollary  (3.3),  we  obtain 
the  following  generalized  Lyapunov-Schmidt  equations,  which 
we  shall  call  Ll  and  L2 : 


(3.5)  (Ll) 


A(„,  3=  - Tv  „ . II  - Eav  (u) 


/V  (p;  c (x)i fi  + w) 


where  J\j , defined  in  (2.3.d)  is  the  remainder  of  the  anal- 
ytic expansion  of  ^(p,f)  at  f = 0,  excluding  the  first 
order  Frechet  derivative  T(j,  but  including  the  second  order 
derivative  T (f,f). 

Should  the  major  problem  of  constructing  a pseudo- 
inverse for  the  left  hand  side  of  (3.5)  be  solved,  the  im- 
plicit function  theorem  in  Banach  Spaces  would  yield  w as  a 
functional  of  c(x): 

(3.6)  w { c ( x ) } : ACI(R1]  -*•  AC  [R1  ( I - E )X]  . 

ap 

To  obtain  the  second  Lyapunov-Schmidt  equation,  we 
apply  the  projection  E (p)  to  (2.5),  use  the  anticommut- 

3 V 

ation  properties  of  Proposition  (3.1)  and  assume  the  result 

(3.6) .  To  simplify  L2 , we  also  need 

Theorem  3.4.  For  a simple  isolated  eigenvalue  XQ(p),  there 


346 


B.  NICOLAENKO 


exists  n (p)  G Y*  and  t|>(p)  e X*,  such  that: 


a (u)  n(p)  = >Mp)  » 

E,_.  (y)  f = *<Mp)  ,f>/<lMP)  ,*>  (p)  >, 

dp  U O 

E (p)  f = A(y)  ^ (p)  <n  (p)  , f >/<<!>  (p)  (p)  > . 

av  O O 

With  the  previous  result,  L2  simplifies  as: 


(L2) 

(3. 7. a) 
where 


= *Q(P)  c(x)  - k(u)  (c  (x) ) 2 


+^{0  (x)  ,w} , 


(3. 7.b) 


<n(p),rM(*o,%)>y* 


k(p)  = 

k(p)  J-  0,  k (p)  = 0(1)  , 

(3. 8. a)  cJ^{c(x)  ,w)  = <r\(p),  /\j  ( P;  c ( x ) sP  o + w) 


- <n(p),r  (c  (x)^>0,c  (x)^Q)  >y* 


and  the  normalization  <iMp)  (p)  >v*  = 1*  (/V  and  r have 

O A 


A 


been  defined  in  (2.3.c-d)). 

A priori  j is  a mapping  on  c(x)  and  w: 

( 3 . 8 . b)  cJt-.  AC1  [R1]  X ac  [R1  -♦  X)  -*•  AC  [R1] 


but  with  the  implicit  functional  w{c(x)}  in  (3.6): 


J: 


AC1  [R1 ] 


AC  ( R ] , 


where  w{c}  depends  globally  upon  c(x),  < x < +°°.  So  in 

fact,  L2  (3.7)  is  a functional  differential  equation  for 

c(x),  global  in  nature.  The  initial  value  problem  is  a 

nonsense,  as  initial  data  ought  to  be  specified  Vx,  -<*>  < 

x < +<*>!  We  remark  that  the  differential  part  (including 

the  quadratic  term)  of  the  functional  differential  equation 

L2  is  in  fact  Landau's  Equation  (24).  The  exact  corrective 

term  to  Landau's  model  is,  interestingly  enough,  neither 

polynomial,  nor  differential,  but  a non-local  mappingc?^. 

• (? 

We^now  define  exactly  the  functional  subspaces  c ' x , 


and  & ap 


appropriate  for  the  investigation  of  L1-L2 . 
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(3.9) 

f G £ if: 


c ACtR1  * (l-Eap)Xl 
£„  c ACIR1  » (1-E  )Y] 

y dv 


a)  6 Ca  [R1  ->  (I-E  )X]  n L1[R1  (1-E  ) X ] 

dx  ap  ap 

2 

b)  Af  G Ca[R1  (I-E  )X]  n L1[R1  -*  (I-E  ) X] 

3x2  aP  ap 

2 

c)  exp(-2X  x)  and  exp(-2A  x)  -^-=-  Af  € L^tR1]  for  x<0 

o 9x  o 3x2 

(asymptotic  decay  at  x = -°°) 

2 

d)  exp((A  -e)x)-|^-  and  exp((A  -e)x)-^-~  Af  e L [R1]  for  x>0 

o dx  o 3x^ 


(asymptotic  decay  at  x = +<=°?  e > 0)  . 


f G £ if  the  conditions  a,  c,  d,  (excluding  b and  any 


conditions  for  — ~ Af)  are  satisfied,  with  X replaced  by  Y, 


l L1 [R1]  ar 

00  1 

G L [R  ] , f 


and  E 

ap 

0 < a < 

c (x)  G 

by  E . 

1 av 

1. 

<£c  if: 

a) 

P-  G C°[ 
dx 

b) 

exp(-A  > 
o 

c) 

exp( ( XQ- 

C (x)  G 

&C  if: 

a) 

c(x)  G , 

b) 

dc  e e 
a*  e 

£ and 
c 


4.  Methods  of  Solution  for  the  Lyapunov-Schmidt  and  the 


Functional  Differential  Eauations. 


Consider  the  first  Lyapunov  equation  (3.5)  as  a 
mapping  $ : 


(4.1. a) 


w)  : Sc  x c£x  » 

/7(0,0)  = 0, 


(4.  l.b) 
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/7(C,W)  = A(m)  g “ T^w 

(4.1.c)  -(I-E  ) Ativ,  c(x)  i + w)  = 0 . 

aV  U 

Theorem  4.1.  (0,0)  is  an  isomorphism  of  M onto  £ .1 

w . x (pi  Yp 

Corollary  4.2.  In  some  neighborhood  of  (0,0)  in  c X <=>x> 

there  exists  a unique  continuous  mapping: 

w{c  (w)  , v } ’.£  ■*■  <£x 

such  that: 

( c ( x ) , w { c ( x ) ,vi})  = 0.  | 

Theorem  4.1  hinges  upon  the  existence  of  0.0)"1;  let: 


(4.2) 


/^(O, 


0) f = A(w)  - T f = S(x) 


To  solve  for  f in  (4.2),  we  construct  generalized 
holomorphic  semi-groups.  The  mapping  /7W(0 ,0)  acts  from 
( I-E  )X  into  ( I-E  ) Y , cf.  the  reduction  diagram  (3.3).  In 

9 P 3 V 

particular  the  reduced  operator  XA(p)  - T^  is  invertible  in 

a neighborhood  of  XQ(p)  (deletion  of  the  eigenvalue  Xq(u)). 

The  essential  spectrum  (cf.  figure  I)  remains  only,  which 

allows  for  the  definition  of  Dunford  Path  Integrals  along 

it.  Let  r , T be  such  paths  along  respectively  the  left 

and  right  side  essential  spectra  (cf.  figure  I). 

Proposition  4.3.  If  S(x)  e £ in  (4.2),  then  the  solution 
— ?? — v 


of  (4.2)  is  given  by: 


(4.3) 


:/><*- 

*/V(X- 


y) [S(y)-S(x) ] dy 


(4.4) 


+ / U (x-y)  (S(y)-S(x)  ] dy  - T^  S(x), 

_ , +oo 

(T~  is  the  pseudo-inverse,  which  now  does  exist 
P * 

even  at  p = u ) ; 

U (x)  = e (XA(u)-T^)  dX; 


II  U-  ||  = O(y^y),  | x 1 - 0; 

although  U4  are  not  semi-groups,  A U and  U A are: 

(4.5)  A U+ (x+y)  = A U+ (x)  A u' (y) , 

for  x>  0,  y>  0orx<  0,  y < 0; 
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(4.6)  ’ 

3AU + 

c)X 

(4.7) 

au‘  a 

(4.8) 

||t  ir  || 

f = T U f , 


f = U T f ; but 
P 


and  the  holomorphic  semi-groups  (4.5)  have  no  infinitesimal 
generator . | 

A technical  hypothesis  needed  for  Proposition  (4.1), 
and  suggested  by  statistical  mechanics  is: 

Hypothesis  4.  Let  R(A,p)  = (AA(p)  - T ) ^ ; then  as  |A|  -*■  °°, 


within  the  resolvent  set: 
(4. a) 


P 


R(A,P)  ||  = 0(-l— ) , 0 < a < 1 

I X I 


or : 


(4  . b)  ||  R(A,p)  ||  = 0(1)  . 

Proof  of  Proposition  (4.1)  is  more  complicated  with 
hypothesis  4.b.  Specifically,  A U(0)  exist  in  the  case 
of  hypothesis  4. a (although  limits  are  projections,  but  not 
the  identity!),  but  are  undefined  under  hypothesis  4.b. 

We  now  investigate  the  functional  differential  eq- 
uation L2  (3.7-8): 

L2  = S - i 
c c 

(4.9)  dc  \ / \ , . , , . , , . . 2 

= XQ(p)  c(x)  - k (p)  ( c ( x ) ) 

+ p , c , w f c } ) . 

Vp , c = 0 is  a trivial  solution  (as  J is  multilinear  in  c 
and  wfc}).  (4.9)  is  again  a full-sized  bifurcation  problem 
from  a continuous  spectrum,  at  p = p ; 

Xo (p* ) = 0,  k(p*)  = 0(1) , p = p*. 

★ dc 

At  p = p , the  Frethet  derivative  of  (4.9)  reduces  to 

The  latter's  spectrum,  in  spaces  of  absolutely  continuous 

functions,  is  a purely  continuous  spectrum  containing  the 

full  left  or  right  half  complex  plane,  including  the 
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imaginary  axis  (depending  on  normalization  of  the  AC  spaces). 

New  techniques  are  needed  for  (4.9).  We  first  make 
the  following  remarks;  in  a neighborhood  of  p = p*: 

XC(P)  = 0(p-p*)  , 

II  c (x)  II  = O ( p-p * ) , 

(4.10)  f = c(x)  ^Q(p)  + 0(p-p*)2 

A (p)  exp (Ax)  ? 

MpT  exp  ( Aqx)  +1  ^o(M)  + ) ; 

the  lowest  order  Landau  differential  operator  approximation 
(4.10)  is  accurate  only  to  0(p-p*).  The  exact  p,c,w) 
contribution  appears  at  0(p-p*)2  and  corresponds  to  devia- 
tions from  the  "Navier-Stokes"  solution  (so  called  since 
the  Landau  equation  (4.9)  without  the  functional  J admits 
the  universal  hyperbolic  tangent  Taylor  weak  shock  profile 
for  one-dimensional  Navier-Stokes  systems) . 

The  key  concept  is  to  consider  (4.9)  not  as  a bi- 
furcation from  c(x)  = 0,  but  as  a branching  from  the  Landau- 
Taylor  profile 

Aq(p)  exp ( Aqx) 

^ k(p)  l+exp(A  x)  V'0 
o 

introduce  a change  of  function,  a change  of 
a change  of  parameter  in  (4.9): 

t = Ao(p)  *»  p = p(x), 

y = ix  = A x , 
o 

^ (lt("yn  ' 

functional-differential  equation  for  0 (y) , on 
parametrized  by  r: 

+ -f^-0  = - 02+  t k(r)  (l+e_y)  Jl  {0}, 

ey+l  e^  + 1 

( 4 . 13 . b)  0=0  at  t = 0 ; 

let  L3  be  the  operator  defined  by  (4.13),  then: 

(4.13.C)  L3  : e^,1  Sc  ! 


(4.11) 

To  do  so,  we 
variable  and 

(4. 12. a) 

(4 . 12 . b) 

(4.12.C) 


L2  becomes  a 

-oo  < y < +0°, 

, . . 0 . d0 

(4. 13. a)  ^ 
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c? T(0(y))  is  identical  to  c(x))  (3.8),  after  substitu- 

tion of  ( 4 . 12 . a-b-c) . 

Whereas  we  look  for  0 -*•  0 as  t -*  0+  (branching  from 
Landau’s  solution  (4.11)),  the  former  trivial  branch  c (x) iO 
now  becomes  0(y)=-l,  Vt . We  have  effectively  achieved  sep- 
aration of  branches.  This  is  confirmed  by: 

Theorem  4.4.  Let  be  the  Frebhet  derivative  of  L3  at 

0 = 0: 

(4.14)  -/(0)  = ^ + -i—  0 

I'  1 


ey+i 


then  _Z'  is  a bounded  mapping  from  ^ 
Remark:  ^ 


Vt  > 0.  | 

is  an  integral  operator  on  -<»  < y < +<*>,  which 


onto  £ 1 
c c 


is  in  general  unbounded  on  spaces  of  integrable  functions. 

This  required  a much  more  complicated  theory  in  [10).  If 

we  do  take  into  account  the  asymptotic  decay  conditions 

included  in  i.  £Z  (3.9): 
c c 

, . d0  . , . d20  ^ r°°/ril. 

exp(-y)  and  exp(-y)  — y e l (R  ) 

ay  dyz 


for  y < 0,  and 

2 

exp((l-e)y)  ^ and  exp((l-e)y)  — ^ e l'^R1] 
y dy 

for  y > 0,  e > 0,  then  Jl  ^ is  bounded  from  onto  c>c^  • 
These  decay  conditions  are,  of  course,  suggested  by  the 
behavior  of  the  derivatives  of  Landau's  solution  (4.11)  at 
y = +«>.  To  conclude: 

Corollary  4.5.  In  some  neighborhood  of  x = 0 in  Rl,  there 
exists  a unique  mapping 


t -*•  0 (t  } 

R1  - £cl  , 

such  that  0{t}  is  the  unique  solution  of  (4.13)  with 
0(0}  = 0. | 


To  demonstrate  the 
function  theorem  applied  to 
ping  from:  R^  X 


corollary,  we  use  the 
(4. 13. a)  considered 


impl icit 
as  a map- 


352 


B.  NICOLAENKO 


Finally  from  0{t},  we  reconstruct 
<■«*>  - <™<'» 


f = c(t)  ^q(t)  + W { C ( T ) } . 


The  solution  is  actually  unique  up  to  a translation,  since 
we  have  chosen  an  arbitrary  (normalized)  origin  is  obtain- 
ing the  Landau  profile  (4.11)  solution  of: 

(4.15)  ||  = Ao(w)  c(x)  - k (p)  (c  (x) ) 2 . 

The  asymptotic  behavior  of  f at  x = ±°°  shows  that 
deviations  from  the  "Navier-Stokes"  component  c(x),  caused 
by  w{c},  appear  0(t  ) in  the  "hot  tail"  of  the  shock. 

Roughly  speaking,  c (y)  decays  0(exp(y))  as  y -*■  -<*>,  whereas 
w(y)  decays  0(exp(2y)).  As  y -*■  +00,  both  c (y)  and  w(y)  decay 
0 (exp- (l-e)y) . 

To  conclude,  we  remark  that  the  concept  of  modified 
Landau's  equation  has  also  been  introduced  by  N.  N.  Janenko 
[25,26]:  he  has  added  higher  order  polynomial  terms  in  c(x) 
to  (4.15),  in  order  to  study  the  transition  to  turbulence 
in  incompressible  Navier-Stokes  flows.  Here,  at  the  kinetic 
level,  we  have  a corrective  global  functional  operator  Jl. 

A natural  extension  of  Problems  I-II  is: 

(4.16)  ||  + A ( p ) ||  - £/(p,f)  = 0 ; 
in  this  respect,  we  have  the 

Conjecture . For  p < p*,  t < 0,  the  second  branch  fi(p,x)  is 
unstable  in  time;  it  is  stable  for  p > p*,  t >0.  (This 
corresponds  to  well-known  Entropy  Conditions  across  the 
shock  for  Navier-Stokes) . Also  more  general  wave  solutions 
of  (4.16)  may  be  investigated,  including  Burgers-like  waves. 
Work  is  in  progress  on  these  questions. 

APPENDIX 

We  summarize  technical  results  of  [8-10].  The 
Boltzmann  equation  [27]  rules  the  evolution  of  a local  vel- 
ocity particle  distribution  F ( <5 ) , with  the  local  velocity 


A GENERAL  CLASS  OF  NONLINEAR  BIFURCATION  PROBLEMS 


353 


vector 

(A . 1 ) c = (c1,  c2,  c3);  c = |c|  . 

The  space- independent  Boltzmann  operator: 

(A. 2)  Q [F,  F]  = 0 

2 3 

is  a bilinear  integral  operator  in  L (R  ) . It  acts  only 
upon  the  velocity  vector  c.  Classically: 

(A. 3)  Q[F,  F]  = 0 ~ F = w(p,  c)  , 

where  i.j  is  a maxwellian  (gaussian)  distribution: 

01  (p'  c)  = (2rrRT?3/2  exp 

where  p is  the  density,  p the  mean  velocity  (directed  along 
the  x-axis)  and  T the  temperature.  These  macroscopic  quan- 
tities which  appear  in  the  Navier-Stokes  equations,  are 
simply  related  to  weighted  averages  of  F(c,  x) : 

P = J"  F (c , x) dc , PM  - f c\  F<c,x)dc  , 

3 p R T = J"  (c-p)2  F (c , x) dc  , 

where  R is  the  perfect  gas  constant. 

In  one  space  dimension,  the  space  dependent  Boltz- 
mann equation  for  the  velocity  distribution 

F (c,  , c,  x,  t)  , c = | o | , 

(A. 4)  1 

X 6 R , -<*>  < x < +“>  , 

becomes : 

(A. 5)  If  + ’ 0tp'F) 

The  second  term  on  the  left  side  is  the  one  dimensional 

version  of  the  ubiquitous  "streaming  operator" 

c.  V F . 
x 

We  look  for  traveling  waves  of  the  type 
Ffc^  c,  x + pt)  . 


2 2 2 
(c^p)  +c2  +C-, 

2RT 
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A viscous  shock  is  defined  as  a nonlinear  trans- 
ition profile  between  two  asymptotic  (x  = + »)  Maxwellians; 
one  with  mean  velocity  p subsonic;  the  other  with  p super- 
sonic. It  must  be  noted  that  the  same  Rankine-Hugoniot 
conditions  as  for  Navier-Stokes  uniquely  relate  p+,  p+,  T+ 
and  p , p , T . After  renormalization  [9,10]: 

(A. 6)  (p+c.)  ■ = L f + T [f,f]  , 

H 1 3x  p p 

where  is  the  Frechet  derivative  of  Q,  and  r^,  an  approp- 
riate second  order  derivative;  together  with  the  normal- 
ization : 

(A. 7)  f(c1,  c,  -oo ) = 0 or  f(c^,  c,  +oo ) = 0 . 

(A. 6)  is  investigated  in  a space  AC  of  absolutely  contin- 
uous functions: 

AC  [R1  -*•  X]  -*■  L£oc[R1  ■+  Y] 

(normalized  at  ±°°,  cf.  (A.  7))  and  X,Y  are  appropriate  graph- 
norm  Banach  spaces  defined  uniquely  on  the  velocity  variable. 
The  following  is  then  demonstrated: 

Proposition  A.l.  In  appropriate  spaces  X,Y  (implicitly 
incorporating  the  Rankine-Hugoniot  conditions) , 

(A. 2bis)  Q [ f , f ] = f + r [f,f] 

is  a bifurcation  problem  from  a simple  isolated  eigenvalue 
of  L * at  the  critical  sonic  value  of  p = p*.  The  two 

— P 

branches  correspond  to  a subsonic  and  a supersonic  Maxwell- 
ian, identical  at  p = p * . | 

Looking  for  a critical  trajectory  joining  the  two 
asymptotic  bifurcated  subsonic  and  supersonic  maxwellians, 
we  consider  (A. 6)  as  a bifurcation  problem  from  the  essen- 
tial spectrum,  superimposed  upon  the  simple  bifurcation 
problem  (A.2bis).  In  (A. 6),  f = 0 is  indeed  a trivial  sol- 
ution Vp.  The  essential  spectrum  is  evident  from  the 
identification : 


(A. 8) 


A ( p ) 


(p+Cj^)  I , 
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• 2 3 

which  does  not  possess  an  inverse  in  L (R  ) , since 

— oo  < < +°° 

(cf . c^  = -p)  . The  hypothesis  required  by  the  abstract 
setting  are  completed  through  the: 

Proposition  A. 2.  The  generalized  eigenvalue  problem 

Afp+c.^  - L^<p  = 0 

has~~a'-r£a  1 , simple,  isolated  eigenvalue  AQ(p),  in  the 
spaces  X and  Y r — . __ 

Aq  ( p TU^p  < P* 

AQ(u)  > 0,  p > P*.  ' 

Similar  results  were  obtained  by  H.  Weyl  in  1949 
[28],  for  the  Navier-Stokes  equations  linearized  about  sub- 
or  supersonic  equilibria.  Finally,  the  "streaming  operator" 
A ( y ) defined  in  (A. 8),  though  responsible  for  the  pathology 
of  the  problem,  is  universally  present  in  kinetic  (stat- 
istical mechanics)  equations.  It  represents  transfer  of 
very  high  velocity  particles,  and  generates  the  essential 
spectrum  of  kinetic  operators. 
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Numerical  Solution  of  Bifurcation  and 
Nonlinear  Eigenvalue  Problems 

Herbert  B.  Keller 


1.  Introduction. 

We  show  in  this  paper  how  a large  class  of  bifurcation  and  non- 
linear eigenvalue  problems  can  be  solved  and  the  numerical  methods 
justified.  Only  equilibrium  problems  are  treated  here,  say  in  the 
general  form 

(1.  1)  G ( u , > ) = 0 

where  G : IB  X IR  — IB  for  some  Banach  space,  IB.  We  find  it  most 
instructive  to  refer  occasionally  to  the  cannonical  example  of  the 
matrix  eigenvalue  problem: 

(1,2)  Au  - > u = 0 , 


where  A is  an  n X n real  matrix  with  real  eigenvalues. 
By  a smooth  branch  or  arc  of  solutions 


(1.  3) 


Tb  : 1 u(s ),  >(s)]  . sa  S s «sb 


we  mean  a one  parameter  family  of  solutions  of  (1.1),  u(s)  e IB, 

>(s)  e IR  depending  twice  continuously  differentiably  on  some  parame- 
ter s e[  s^,  sbJ  • Of  course  the  parameter,  s,  is  quite  arbitrary  on 
each  such  branch  and  this  fact  is  crucial  in  our  study.  In  figures  la 
and  lb  we  sketch,  respectively,  typical  solution  branches  for  problems 
(1.1)  and  (1.2).  The  "points"  at  ) . and  > represent  "simple"  bifur- 
cation  while  > ^ is  a multiple  bifurcation  point.  The  "arcs"  F 
in  figure  lb  for  j # 0 represent  the  eigenspaces  of  A belonging  to 
the  eigenvalues  >.  while  To  is  the  trivial  solution.  For  simple 
eigenvalues  the  bifurcation  is  simple  (i.e.  a one  parameter  family  of 
solutions  branches  off).  At  > = on  branch  T|  in  figure  la  we 
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Figure  lb. 
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have  what  is  called  a "limit  point"  in  the  applied  mechanics  literature. 
These  present  no  difficulties  analytically  or  computationally  in  our 
current  theory  although  they  seem  to  have  been  quite  troublesome  in 

the  past  [ 1,  12,  20]  . Note  that  all  the  branches  Tj , T2 in  the 

eigenvalue  problem  are  composed  entirely  of  these  limit  points. 

We  assume  that  the  basic  problem  is  to  compute  large  segments 
of  solution  branches  of  (1.  1)  including  all  branches  bifurcating  from 
each  segment. 

2.  Pa rametrization  and  Continuation  of  Solution  Arcs 

— — ■■  ~ — — ■ - 

The  standard  approach  is  almost  invariably  to  use  X,  one  of  the 
naturally  occuring  parameters  of  the  problem,  as  (he. parameter  de- 
fining solution  arcs,  u(>  ).  Indeed  if  for  some  > = >0  a solution 
u = u0  of  (1.  1)  is  isolated,  that  is 

(2.1)  G°  s Gu(u0,>0) 

is  nonsingular,  and  if  G(u,  >)  is  C,  in  some  p0  - sphere  about 
l u0,>0]  thffh  the  implicit  function  theorem  insures  the  existence  of  a 
unique  smooth  arc  of  solutions  u = u(>  ) for  |>  - >0  I < Pi>  say. 
Furthermore,  with  our  assumed  smoothness,  it  follows  that 
du(>  )/d>  exists  and  satisfies- 

(2.  2)  Gu(u(>),  = - G>(u(>),  >) 

Many  procedures  are  now  available  for  extending  or  approximating 
the  solution  branch  through  [ u0  , > 0 1 • The  implicit  function  theorem 
suggests  contraction  mapping  techniques  while  (2.  2)  suggests  predic- 
tor-corrector continuation.  A nice  survey  of  the  latter  ideas  (for 
finite  dimensional  problems)  is  given  by  Rheinboldt  f 18].  In  particu- 
lar we  have  frequently  used  [ 3]  one  step  of  Euler's  method  in  (2.  2)  as 
a predictor: 

(2.  3)  a)  u°(>  + 6>)  = u(>)  + 6> 

to  supply  the  initial  iterate  for  Newton's  method  to  solve  (1.  1)  at 

> + 6> : 

(2.  3)  b)  G^  6uv(X  + 6>  ) = - GV  , v-  = 0,  1 ; 


whe  re : 
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(2.  3)  c) 

It  is  not  difficult  to  base  existence  proofs  on  such  techniques  provided 
the  arc  consists  of  isolated  solutions. 

All  of  the  indicated  continuation  procedures  may  fail  or  encoun- 
ter difficulties  as  a nonisolated  solution  is  approached;  that  is  a 
point  [u0>>0]  where  G°  is  singular.  We  also  call  these  singular 
points.  As  we  shall  see  in  §3  the  above  indicated  continuation  pro- 
cedures could  skip  over  some  singular  points  but  not  over  limit 

points  as  at  > = > in  figure  la.  Also  at  bifurcation  points  some 

special  procedures  are  required  to  switch  from  one  branch  to  an- 
other. A simple  analysis  shows  that  the  scheme  (2.  3)  is  incapable  of 
tracing  out  any  nontrivial  solutions  of  the  eigenvalue  problem  (1.  2)! 

To  circumvent  these  difficulties  we  recall  that  the  parametriza- 
tion  of  solution  arcs  is  at  our  disposal.  Thus  we  are  free  to  impose 
some  additional  constraint  or  normalization  on  the  solution  and  we 
do  this,  quite  generally,  by  replacing  (1.  1)  by: 

(2.1)  a)  G(u,  > ) = 0 , b)  N(u,  >,s)=0  . 

Here  N : IB  X IR2  IR  and  s e IR  is  the  independent  parameter  on 

the  solution  arc.  We  shall  show  several  choices  for  N which  make 
s an  approximation  to  "arclength"  on  the  solution  branch.  Then,  as 
we  shall  see,  limit  points  essentially  disappear,  it  is  easy  to  jump 
over  singular  points  or  to  compute  them,  relatively  large  steps  in  s 
can  be  taken  and  it  is  easy  to  switch  branches  at  bifurcation  points. 
Note  that  we  are  not  simply  changing  the  parameter  in  the  problem  as 
was  first  done  in  [ 12]  or  in  another  way  in  [ l]  . 

By  introducing  x e if  s IB  X IR  and  P : X X W -*  X as 

/G(u,  >)  \ 

(2.  5)  a)  x = [ u,  >]  , P(x,  s)  s | I 

\N(u,  >,  s )/ 


fGu  5 Gu(uV(X  + 6>),  X +6X  ) , Cv  = G(uv(X  + 6X),  >+6>)  , 

V 


uv+1(>+6>)  = uv(X  +6X  ) + 6uv'(>+6>)  . 


a solution  arc  of  (1.  I)  or  (2.  4)  is  x(s)  £ f u(s),  >(s)]  and  it  satisfies 
(2.  3)  b)  P(x(s),  s)  = 0 . 
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For  fixed  s a solution  x(s)  is  isolated  if 


(2.6) 


A(s)  = Px(x(s),  s)  = 


'Gu(u(s),  >(s)) 


(u(s ),  >(s)) 


Nu(u(s),  >(s),  s)  N^ujs),  X (s),  s)y 


is  nonsingular.  Furthermore  on  a smooth  arc  x(s)  = dx(s)/ds 
satisfies 


(2.  7) 


/J(s)  k(s)  = - 


O 


kNg(u(s),  >(s),  s)/ 


Now  continuation  in  s could  proceed  in  exact  analogy  with  our  prior 
continuation  in  X.  However  the  possible  advantages  of  our  reformu- 
lations are  clarified  by  the  fact  that  Px  can  be  nonsingular  while 
G is  singular.  Indeed  this  is  but  a special  case  of  the  basic: 

Lemma  2.  8.  Let  IB  be  a Banach  space  and  consider  the  linear 
ope rator  A • IB  X IRV  — * IB  X IR  of  the  form: 


A : IB  — IB  , B : IR  — IB  ; 


where 


♦ U 

C : IB  — IR 


D : IR 


IR 


* -1 

D - C A B is  nonsingular  . 


i)  If  A is  nonsingular  then  A is  nonsingular  iff: 

(2.8)  a) 

ii)  If  A is  singular  and 

(2.  8)  b)  dim/V  (A)  = codim/?  (A)  = v 

then  A is  nonsingular  iff: 

c0)  dim  R (B)  = \>  c,)  R (B)O  R (A ) = 0 , 

(2.8)  * * 

c*  ) dim/?  (C  ) = v . c3)  N (A )0/V  (C  ) = 0 . 

iii)  If  A is  singular  and  dimA^  (A)  > v then  A is  singular. 


Proof.  Not  difficult  to  work  out;  it  will  appear  elsewhere,  [10] 
Using  Lemma  2.  8 and  our  reformulation  of  (1.  1)  it  is  easy  to 
prove  a variety  of  bifurcation  theorems  which  we  do  not  report  here. 
We  only  use  in  the  present  work  the  special  case  of  Lemma  2.8  with 
v = 1.  Then  given  (2.8b)  the  conditions  in  (2.8c)  simply  reduce  to 
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.(2.9)  B//t(A)  and  C*  f R(A*)  . 

As  a simple  example  of  the  use  of  our  procedure  consider  the  eigen- 
value problem  (1.  2)  subject  to  either  of  the  normalizations 

(2.  10)  a)  N0(u,X,s)=X-s  ; b)  Nj  (u,  X,  s)  = Hull2  - s2 


Using  N0  0 the  predictor-corrector  continuation  scheme  generates 
the  trivial  solution  branch,  u = 0,  > = arbitrary,  starting  from  any 
point  on  this  branch.  Alternatively  using  N,  = 0 our  scheme  traces 
out  the  one  dimensional  eigenspace  belonging  to  any  simple  eigen- 
value X = X0.  Switching  over  from  one  of  these  normalizations  to  the 
other  at  a simple  bifurcation  point  (u,  X)  = (0,  Xo)  allows  us  to  trace 
out  the  two  branches  through  this  point.  Indeed  this  is  the  key  to  our 
method  for  switching  branches  at  bifurcation  points  in  the  general  case, 
see  §5. 

A somewhat  natural  parametrization  of  a solution  branch 
[ u(s),  X ( s ) ] is  to  use  for  s a form  of  arclength.  That  is,  for  some 
6 e(0,  1): 


(2.  10)  c)  Ne(u,X,  s)  = 0llu(s)ll2  + (1  - 6)  |X(s)|2  -1=0. 

This  form  is  not  the  most  practical  one  to  use,  even  for  merely  prov- 
ing existence  theorems  and  so  we  use  approximations  to  it.  Assum- 
ing a solution  of  (1. 1)  known  say  [ u,  > ] = [ u0  , X0  ] we  set  [ u0  , X0  ] = 

[ u(s0  ),  X(s0  )]  and  define  over  s0  ^ s < s1 : 


(2.10)  d) 


N2  (u,  X,  s)  = Ollu(s)  - u(so)H2  + (1-0)  | X ( s ) - > ( s0  ) | 2 
- (s  - s0  )2  = 0 • 


Alternatively  if  in  addition  to  [ u0  , X0  ] we  know  [ u0  , Xo  ] satisfying 
(2.10c)  at  s = s0  then  we  can  use  on  s0  < s < s , : 

10)  e)  Nj  (u,  X,  s)  = 0 u*(s0  )[  u(s)  - u(s„  )] 

+ (1-0)  X(s0  )[  X(s)  - X (s0  )]  - (s  - s0)  = 0 . 

. * * 

Here  u (s0  ) c IB  is  the  dual  element  to  u(s0  ) such  that 
u (s0  ) d(s0)  = lld(s0)ll2.  (The  existence  of  such  an  element  is  assured 
by  the  Hahn-Banach  theorem.  ) We  call  N2  or  N3  pseudo-arclength 
normalizations  and  examine  some  of  their  properties  in  §3.  Pre- 
vious attempts  to  use  arclength  as  a parameter  in  solving  nonlinear 
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algebraic  systems  have  been  made  in  [ 13,  14] . 

3.  Continuation  About  Regular  and  Limit  Points. 


We  shall  justify  continuation  procedures  using  the  normalization 
Nj  on  solution  arcs  composed  of  "regular  points"  or  "normal  limit 
points".  Specifically  let  [u0,A0]  be  a solution  of  (1.1)  and  let 
[ ia0  , X o ] satisfy 

(3.  0)  a)  G®  u0  + G®  A0  = 0 , b)  llu0  II2  + |>0  |2  > 0 . 

Then  we  say  that  | u0  , >0  ] is  a regular  solution  (point)  if  in  addition: 

(3.1)  G®£Gu(u0,A0)  is  nonsingular  . 

We  call  [ u0  , >o  ) a normal  limit  solution  (point)  if  (3.  0)  holds  but  in 
place  of  (3.  1)  we  have: 

(3.2)  a)  dim/V(G°)  = codim/?  (G®)  = 1 , b)  G®//?(G®)  . 

Theorem  3.  3.  Let  [u0,A0]  be  either  a regular  solution  or  a normal 
limit  solution.  Let  G(u,  A)  have  two  continuous  derivatives  in  some 
sphere  about  [ u0  , Ao  ] . Then  with  f u(s0  ),  >(s0  )]  = [ u0  , A0  ] , 

[ u(s0  ),  A(s0  ] s f u0  , >0  ] and  u (s0  ) as  defined  after  (2.  lOe)  there 
exists  a unique  smooth  arc  of  solutions  [ u(s),  >(s)]  of  (2.4)  using 
N = N3  on  Js  - Sq  | < p for  some  sufficiently  small  p > 0.  On  this 
solution  arc  the  Frechot  derivative  A (s)  of  (2,6)  is  nonsingular. 


Proof.  All  of  these  results  will  follow  from  the  implicit  function 
theorem  applied  to  (2.  4 ) at  [ u,  A , s ] = [ u0  , Ao , s0  ] if//(s0)  is  non- 

singular.  We  first  consider  [u0,Ao]  to  be  a normal  limit  solution. 

By  (3.  2b)  in  (3.  0a)  it  then  follows  that  A0  = 0 and  hence  u0  e/V(G®  ). 

* •*//)(!*  U 
Now  (3.  2a)  implies  since  u0  u0  # 0 that  u0  ).  This  result 

and  (3.  2)  now  yield  with  v - 1 in  Part  ii  ) of  Lemma  2.  8 that 


A(u o ) = 


is  nonsingular  [ use  the  form  in  (2.  9)]  . 

Next  let  [u0,A0]  be  a regular  solution.  If  X0  =0  then  by  (3.0a) 


/G® 

G® 

[ u 

> 

1 

\ . * 

\0uo 

(1-0)A 
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and  (3.1),  u0  =0.  This  contradicts  (3.0b)  so  X0  # 0 at  a regular 
point.  Then  (3.0a)  and  (3.1)  imply 

h0 /So  = - (G®)-1  G®  . 

Now  by  Part  i)  of  Lemma  2.  8.  /!{ s0  ) above  is  nonsingular  iff 

( 1 - e )>■  o - 0 ho  *(G“f1  G«*0  . 

That  is,  using  the  above,  iff 

[ e u0*u0  + (i-e)X02]/Xo  = [ enh0ii2  + (i-e)  |>‘0  |2]/>'o  * o . 

Since  >'0  1 0 and  llu0ll  # 0 it  follows  that  //(s0)  is  nonsingular.  ^ 
Clearly  any  smooth  branch  of  solutions  composed  of  regular 
points  or  normal  limit  points  can  be  determined  using,  say,  Eule  r- 
Newton  continuation  on  (2.  4)  with  the  normalization  Ns  N3  . We 
could  easily  justify  the  normalization  N = N2  for  smooth  arcs  since 
on  them: 

u(s)  - u(s0  ) = u(s0  )(s  - s0  ) + 0(  | s - s0  |2  ) , 

>(s)  - X(s0  ) = >'(s0  )(s  - s0  ) + (X  |s  - s0  |2  ) . 

When  using  (2.  10,e)  over  a sequence  of  intervals  [ s„  , s , ] , [s,,s2],... 
it  is  a good  policy  to  impose  the  arclength  condition 

0llu(s)ll2  + (1  - 0)  J > ( s ) |2  = 1 


periodically,  say  at  each  joint,  s = s^.  The  resulting  arc  is  then 
only  piecewise  smooth  in  s,  with  jump  discontinuities  in  the  length 
of  the  tangent  vector  [u(s),  X(s)]  at  s = s^.  Specifically  if 
[ u(Sj^),  >(s^)]  is  the  limit  as  s { s^  then  we  use  on  [ s^,  sk+^] 


[u(sk),  >(sk)]  =clu(sk),  >(sk)]  , 

c"2  = 0llu(sk)ll2  + (1-0)  |X(sk)|2  . 


This  renormalization  allows  more  uniform  steps  in  s to  be  taken 
during  the  continuation  process. 


4.  Continuation  Past  Singular  Points. 

A solution  x(s)  = [ u(s),  >(s)]  of  (2.  4)  is  said  to  be  singular  or  a 
singular  point  if  /?( s)  in  (2.  6)  is  singular.  We  will  consider  smooth 
arcs  of  solutions  x(s)  for  sa  « s S sj,  on  which  only  x(s0  ) for  some 
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s0  <- ( s ^ , s^)  is  singular.  Under  mild  smoothness  conditions  various 
continuation  procedures  can  "jump"  over  the  singular  point  in  going 
from  s^  to  s^.  A simple  way  to  do  this  is  by  using  Euler-Chord  or 
Euler-Newton  continuation,  as  we  proceed  to  show. 

We  assume,  as  stated  above,  that 

(4.  0)  /)  (s)  = P (x(s),  s)  is  nonsingular  for  a e[  s , s,  ] - { s0}  . 

Then  the  tangent,  xfs^),  i-s  uniquely  defined  by 

(4.1)  a)  /}(s  ) x(s  ) = - P (x(s  ),  s ) , 

d d odd 

and  an  approximation  to  x(s)  is 


(4.1)  b) 


x°(s)=x(s  ) + [ s - s ] x(s  ) 
a a a 


Using  this  approximation  we  consider  the  chord  (or  special  Newton) 
method : 


a)  //°(s)  — P (x°(s),  s)  , 


(4.  2) 


b) 


/l°  ( s)  ^xv,1(s)  - xV(s)J  = - P(xV 


(s),  s)  , v = 0,  1, 


Ve  could  also  try  Newton's  method,  with  y°(s)  = x°(s): 


(4.  3) 


0 /T(s)  = P (yV(s),  s) 


b)  /T(s)  ^yV+1(s)  - yV(s)J  = - P(yv(s),  s) 


v = 0,  1 


To  get  convergence  we  need  only  show  that  x°(s)  is  in  the  appropriate 
domain  of  attraction  about  x(s)  and  that  the  /jv (s)  are  all  nonsingu- 
lar. We  do  this  in 


Theorem  4.  4.  Let  x(s)  be  a twice  continuously  differentiable  arc  of 
solutions  of  (2.  5)  on  [ s^,  s^]  . Let  (4.  0)  hold  and  for  some  positive 
functions  K(s),  ( s ) and  p ( s ) defined  on  [ s^J  : 

a)  !IPx(y,  s)  - Px(x(s),  s)ll  «K(s)  lly  - x(s)ll  V y in 

(4.  4)  lly  - x(s)ll  p(s)  ; 

b)  max  IIJf(t)ll  < X.  (s)  . 

s «t«s 

a 

F or  s e[  s^,  Sj  J ■ { so}  define 
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(4.4)  c)  M(s)  = II /I  L(s )ll  , 

and  for  some  positive  0(s)<l/3  define 

(4.4)  d)  r(»)S  min[p(s),  ] . 

Then  if 

(4.4)  e)  js  - s a |2  (s)  2r(s)  , s * s0  , 

the  Chord  iterates  xv(s)-»  x(s)  with  geometric  converRence  factor 

29(s) 

1 - G(s)  • 


Proof.  It  easily  follows  from  (4.  1)  and  (4.  4)  that 

llx°  (s)  - x(s)ll  ^ ^ |s  - sa  |2  K-  (s)  . 

Then  (4.  4d,  e)  imply  that  llx°  (s)  - x(s)ll  < r(s).  Now  (4.  4a,  c ) y^'ld 
H//"1(s)t/°(s)  -/\(s)]  II  «M(s)  K(s)  r(s ) < 6(s)  . 

Thus  the  Banach  Lemma  insures  that  /l°  ( s)  is  nonsingular  with 

We  can  now  define 

H(y,  s)  = y -/7°  (s)  1 P(y,  s) 
and  for  all  y,  z in  II y - x(s)ll  ^ r(s)  : 

HH(y,  s)  - H(z,  s ) II  = ll/°  (s)'1^0  (s)(y-z)  - (P(y,  s)  - P(z,  s)]  II  , 

K<s>  2r<s>  "y-zl1  ■ 

< — ) Hy  _ z|| 

1 - 0(s)  y • 

Since  0(s)  < 1/ 3 we  get  that  H(y,s)  is  contracting  on  lly-x(s)ll  « r(s) 
and  the  theorem  follows.  ^ 

Note  that  M(s)  must  become  unbounded  and  thus  r(s)  — 0 as 
s -*  s0  . Our  result  thus  uses  the  fact  (see  figure  2)  that  there  is  a 
cone  about  x(s)  with  vertex  at  x(s0)  and  interior  to  this  cone  the 
chord  method  converges.  To  jump  over  a singular  point  the  tangent 
to  x(s)  at  x(«a)  need  only  penetrate  the  cone  for  some  s > s0  for 
which  (4.  4e)  holds.  Clearly  if  the  curvature  of  the  solution  arc  is  not 
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too  great  over  [ s , s^]  this  can  be  achieved. 

Newton's  method  can  also  be  shown  to  converge  in  the  same  cone. 
However  Newton's  method  may  even  converge  in  a much  larger  re- 
gion, including  a cylindrical  tube  about  x(s).  Thus  the  singular  solu- 
tion x(s0  ) can  be  determined  directly  in  such  cases  not  only  by  bi- 
section. Unfortunately  the  status  of  the  convergence  of  Newton's 
method  at  singular  points  is  not  completely  clear  at  the  present  time. 
The  main  idea  and  no  doubt  the  behavior  to  be  expected  in  many  cases 
is  explained  in  the  basic  paper  of  Rail  [16],  But  the  details  of  a 
proof  with  reasonable  sufficient  conditions  seem  to  be  lacking.  Pro- 
gress in  this  direction  has  recently  been  made  by  Reddien  f 1 7 J . 

5.  Switching  Branches  at  Bifurcation  Points. 

Bifurcation  points  are  solutions  at  which  two  or  more  smooth 
branches  of  solutions  of  (1.1)  have  non-tangential  intersections.  In 
particular  they  are  singular  points,  say  [ u0  , X0  ] , at  which: 

a)  dim/ty  (G°  ) = codim  R (G°  )•  = m , 

(5.0)  u u 

b)  G®  e/?(G°u)  . 

From  (5.  0a)  we  have  the  existence  of  elements  <b.  e IB  and  iL.  e EB 
, , J J 

such  that: 

A(G®)  = span  {<(>,,  & , . ...  <(>  l') 

U 1 Jjc 

(5.  1) 

^ (G®  ) = span  {4/1 , ih 4/mj 

In  addition  (5.  Ob)  implies  the  existence  of  a unique  element  (j>0  e IB 
such  that: 

(5-  2)  G°n  4>o+G®=0  ; 4>o  = 0 , 1 «j  <m  . 

Let  [ u(s),  X(s)]  be  any  smooth  branch  of  (1.  1)  through  the  bifur- 
cation point,  say  with  u(s0)  = u0 , X(s0  ) = X0  . Then  since 

(5.  3)  a)  G®  u(s0)  + G®  X(s0  ) = 0 

it  follows  from  (5.1)-(5.  2)  that 

(5.  3)  b)  u(s0 ) = ) a.  4>. 

j=o  J J 


) 4>-  = 6.s  I i.  j = 1,  2,  . . . m. 

I i J ij  J 


BIFURCATION  AND  NONLINEAR  EIGENVALUE  PROBLEMS 


371 


where 

♦ 

(5.  3)  c)  o o = X(s0  ) ; = 4^j  u(s0  ) , 1 < j <m  . 

We  get  by  differentiation  of  G(u(s),  X(s))  =0  at  s = s0  : 

(5.  4)  G°  do  = - [ G®  ti(s0  )u(s0  ) + 2G°  u(s0  ) X’(s0  ) 

U UU  UA 

+ G°x  >(s0)  X(s0)]  - G®  X(s0  ) • 

Since  G®  ii(s0  ) e ft(G°  ) and  G?  e fi(G°  ) the  bracketed  term  on  the 

VJl  U A ^ 

right  side  of  (5.  4)  is  also  in  /^(G^).  Then  4^  [ . . . ] =0  must  hold  for 
this  term  and  so  using  (5.  3)  it  follows  that  the  m + 1 scalars 
{oq a-  } mast  satisfy  the  quadratic  system: 


(5‘5)  a)  jiLiijkTk"2  i 


) b. . a.  «0  + c.  = 0 , l^i 

M 1J  J 1 


i < m ; 


whe  re 


(5.5)  b)  a.-.s^G®  4>.<h  , b..=  4>.[G®  <}>0  + G®  J 4>.  , 

x ' ' ljk  1 UU  J 4C  1J  NL  UU  UA  J 

1 ^j,  k ; 


ci=  (Guu  <t>®‘*)®  + 2G°U>  ^ + G>>)  • 

Thus  the  tangent,  [ u(s0  ),  X(s0)],  to  every  smooth  branch  through 
the  bifurcation  point  [ u0  , Xo  ] must  have  the  form  (5.  3b,  c)  and  satis- 
fy (5.  5).  Conversely  if  at  a solution  [ u0  , Xo  ] of  (1.  1)  conditions  (5.  0) 
hold  and  (5.5)  has  r ^2  distinct  nontrivial  roots  ^ then  [u0,Xo]  is  a 
bifurcation  point  with  at  least  r smooth  solution  branches  of  (1.  1)  in- 
tersecting there.  Essentially  this  result  is  proven  in  [ 11]  . In  the 
important  special  case  m = 1,  the  algebraic  bifurcation  equations 
(5.  5)  reduce  to  the  single  quadratic 

(5.  6)  a)  4*  2b  j ] o' j Oq  + Cj  &o  = 0 . 

If  [ar0,a]]  is  one  nontrivial  root  of  this  quadratic  then  the  other  root 
is  distinct  provided 


(5.6)  b)  a nl  aj  + bn  «0  * 0 . 

If  [ uj  (s),  X,  (s)]  is  a smooth  branch  of  solutions  through  the 

f Since  (5.  5a)  is  homogeneous  a root  is  determined  only  to  within  a 
multiplicative  constant.  Roots  are  said  to  be  distinct  if  they  are  not 
scalar  multiples  of  each  other. 
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I 


i 

* 

L 


bifurcation  point  with  tangent  [ u,(s0  ),  X'i(s0  )]  determined  by 
[ <*o  . ] in  (5.  3b,  c ) with  m = 1 then  the  condition  (5.  6b)  can  be  writ- 

ten as: 

(5.6)  c)  Ms0)  + G°u>  Ms®  )]«>,*  0 • 

This  is  essentially  the  form  of  the  bifurcation  condition  given  by 
Crandall  and  Rabinowitz  [ 5],  However  it  is  seldom  pointed  out  that 
this  condition  insures  the  existence  of  two  distinct  roots  of  a quadra- 
tic^; Rheinboldt  [ 18]  makes  this  quite  explicit  for  the  class  of  pro- 
blems he  treats. 


Method  I.  An  obvious  way  to  determine  branches  bifurcating  at 
[u0,X0]  is  to  determine  several  distinct  roots  of  (5.  5),  use  them  in 
(5.  3b,  c)  to  construct  several  distinct  tangent  vectors  [ d^(s0  ),  X'^(s0  )] , 
k = 1,  2,  . . . , then  use  each  of  these  tangents  in  N3  =0  of  (2.  lOe) 
and  proceed  as  previously  indicated.  For  m = 1,  simple  bifurcation, 
similar  devices  have  been  used  and  suggested  in  analytical  and  per- 
turbation studies  [ 7,  18,  21]  . Rheinboldt  [ 18]  uses  an  approximation 
to  the  second  root  in  some  of  his  numerical  methods  to  predict  a point 
on  the  "second  "branch.  This  idea  can  be  used  quite  generally  to 

obtain  approximations  to  the  coefficients  {a...  , b..,  c.}  in  (5.  5)  if 
$ i J i 

the  4k  and  ^ are  known  (or  sufficiently  well  approximated).  Thus 


we  define: 


a) 

Ve)* 

b) 

b. . ( e ) = 

ij 

(5.  7) 

c) 

c.(e)  s 

,*  1 


+ [ G^(u0  + e<|>j,  Ao  ) - G^]} 


+ 2[G^(u0  + cfo , >0  ) - G°] 


[G  (u0,  >0  + e)  - G°]} 


f In  bifurcation  from  the  trivial  solution  [ u,  (s),  X j (s  )]  = [ 0,  s]  it  fol- 
lows that  4>o  = 0,  since  G^  = 0.  Hence  cj  = 0 and  the  quadratic  (5.6a), 
in  the  case  m=l  of  simple  bifurcation,  has  the  two  solutions:  «,/«„  = 0, 
“i/oo  =“2b11/aUI.  Clearly  b,  j * 0 if  (5.  6c)  holds  since  (ij  (s0  ) = 0. 


L 
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Clearly  {a.^U),  b.j(e),  c.(e)}  — { a b^,  c.}  as  e — 0.  The  a- 
bove  scheme  avoids  the  need  for  determining  second  Frdchet  deriva- 
tives. 


Method  II.  There  are  other  devices  which  avoid  the  need  to  evaluate 
the  coefficients  or  roots  of  (5.  5).  This  assumes  that  one  branch 
through  the  bifurcation  point  has  been  determined.  Then  the  tangent 
[u(s0),  <(s0)]  can  also  be  assumed  known  on  this  branch.  The  idea 
is  simply  to  seek  solutions  on  some  subset  "parallel"  to  the  tangent 
but  displaced  from  the  bifurcation  point  in  some  direction  "normal"  to 
the  tangent. 

For  example  in  the  case  m = 1 of  simple  bifurcation  the  known 
solution  branch  [u1(s)>  X j (s )]  has  the  tangent  at  the  bifurcation, 
s = s0  given  by  (5.  3b,  c).  An  "orthogonal"  to  this  tangent  in  the  plane 
spanned  by  [ <j>j,  0]  and  [ cfo  , 1]  is  also  given  by  (5.  3b,  c)  but  with 
O'!  and  a 0 replaced  by: 

(5.  8)  a)  a,  = a0  (1  + llcfo  II2  ) , a0  = - a , II  <p,  llz  . 


We  use  Newton's  method  to  solve  (5.  9)  for  v e IB  and  p e 3R  with  the 
initial  estimate  (v°  , rj°  ) = (0,  0).  Here  e must  be  taken  sufficiently 
large  so  that  the  scheme  does  not  return  to  (u  j (s0  ),  > , (s0  ))  as  the 
solution. 


Method  III.  Another  way  to  determine  a branch  bifurcating  from  a 
known  branch  [ u(s),  >(s)]  at  s = s0  is  to  apply  a constructive  exis- 
tence theory,  using  iterations,  say  as  in  [ 8,  11]  . To  sketch  the  basic 
idea  we  consider  simple  bifurcation  and  seek  the  bifurcated  branch 
in  the  form 
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(5.10)  a) 


a = u^cr)  + e[  <t>j  + v]  , 4;j*v  = 0 ; 

> - > i (cr)  . 


Then  (1.  1)  is  written,  using  (5.  10a),  as 

(5.10)  b)  G°v  =G°uv  -^G(u,(ff)  + e[  <♦>, +vj , > , (o^))  ; i|,*v  = 0 . 

To  insure  that  the  right  hand  side  is  in  /?(G°  ) we  try  to  pick  <j  - s 
such  that  h(s;  e,  v)  = 0 where 

+ i [ G°  v - 7 G(u,  (s)  + e[  4>j  +v] , X]  ( s ) )].  e # 0 , 


(5. 10)  c)  h(s;e,  v)  h 


4*1  [ v " G (uj  (s),  X,  (s))  [ <j>j  +v]  , e = 0 


It  easily  follows  that  h(s0;0,  0)  = 0 and 

(5.  10)  d)  h°  = hs(s0  , 0,  0)  = - ij,  *[  G®  u u,  (s0  ) + G®  > V,  (s0  )]  . 

Thus  as  in  (5.  6),  hs(s0  ;0,  0)^0  and  so  the  implicit  function  theorem 
yields  a root,  s = cr(e,  v),  of  h(s;e,  v)  = 0.  We  use  this  root  in  (5.  10b) 
and  then,  by  contraction  maps,  it  is  easily  shown  that  (5.  10b)  has  a 
unique  solution  v = v(e)  for  |e|  sufficiently  small  [ 10] . 

The  main  difficulty  in  applying  the  above  procedure  is  in  solving 
h(s;e,  v)  = 0 for  s at  each  iterate  v = vV,  say.  Of  course  if  X oc- 
curs linearly  in  the  problem  and  it  is  used  as  the  parameter,  s,  then 
this  is  trivial.  But  when  > occurs  nonlinearly  as  it  must  for  secon- 
dary bifurcation,  modifications  must  be  introduced.  Several  have 
been  proposed  in  [4,6,15,19].  For  example  given  the  iterate, 

(ffV.  vV),  we  could  use  the  chord  method  to  define  crV+1  as  in 


(5.11)  a) 
and  then  v 
(5.11)  b) 


0 V +1  o V I/V  £ t 

m a = m°  a - h(a  ; v ) 


m°  = B 


v+1 


is  obtained  from 

7G(ui(aV)  + e[<j>1+vV],  > j ( crV ) ) 


G°  vv+1  = G°  v 
u u 


- b[ 


v+1 


Vi  . * V+1  - 

a ];  ip,  v = 0 


Applying  4«,  to  the  right  hand  side  above  we  see  that  it  is  in  R( G°  ). 
Furthermore  with  the  choice 

(5.12)  a)  B =B°  S fGuu  “i(s0)  + G“x  X’,(s0)] 

it  follows  from  (5.  lOd)  that  m°  = h'  ^ 0.  There  is  no  difficulty  in 
showing  convergence  of  the  above  scheme.  To  avoid  the  evaluation  of 
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second  derivatives  we  can  use  the  trick  in  (5.  7)  and  take: 

(5.12)  b)  B = B(e)  s Gu(uj(s0  ) + e<t>,,  >,(s„))  uj(s0  ) 

+ G.(u,(s0  )+€<(>!,  Xi(s„  ))  Vjfs,,  ) . 

Clearly  B(e)  = B°  + 0(e)  so  that  B(e)  = h°  + 0(e)  and  the  proofs 
proceed  as  before.  This  modification  is  due  to  Rheinboldt  [ 19 ] where 
it  is  justified  for  finite  dimensional  problems.  He  also  shows  that  if 
the  solution  branch  [ Uj  (s),>  j (s)]  in  (5.11)  is  replaced  by  the  "parabo- 
lic" approximation:  ^ 


Uj(s)  = Ujfso  ) + (s-s„  ) uj  (s0  ) + i(s-s0  )2  Cf  j ( s0  ) 


(5.13) 


>i  (s)  = >i  (s0  ) + (s-s0  ) Xj  (s0  ) + j(s-s0  )2  >i  (s0  ) 
the  procedure  still  converges  to  a bifurcated  solution. 

Method  IV.  A final  method  for  determining  the  bifurcating  branch  at 
a simple  eigenvalue  is  to  use  a technique  based  on  a modification  [ 10] 
of  the  Crandall  and  Rabinowitz  [ 5]  proof  of  bifurcation.  Thus  we 
again  seek  solutions  of  the  form  (5.10a)  and  define 

J 7 G(ui(s)  + e[  <f>i  + v] , > i (s ))  , e * 0 ; 

(5.  14)  a)  g(v,  s;e)  =< 

[gu(ui(s),  X,  (s))[  <t>,  + v]  , e=0  ; 

♦ 

b)  N(v,  s;e)  = i|q  v . 

Now  we  note  that 

(5.15)  a)  g(0,  s0;0)=0  , N(0,  s0;0)=0 

and  the  Frechdt  derivative  at  (v,  s;e)  = (0,  s0;0)  is 

(5.15)  b)  A0  | = (G{i  B] 


0 

, N(0 

, s0 ;0) 

= 0 

(v. 

s;e)  = 

(0,  s0  ; 

0)  is 

1 

/g° 

B°\ 

- 

1 

(0. 

so  ;0) 

* 

\+i 

°J 

where  B°  is  given  in  (5.  12a).  If  (5.  6c)  holds  then  by  our  Lemma  2.  8 
it  follows  that  A0  is  nonsingular.  Now  the  usual  implicit  function 
theorem  shows  that 

fg(v,  s;e)  = 0 , 

(5-15)  C)  \N(v.s;e)-0  . 

f In  l 19]  the  parameter  s = > is  employed  but  the  above  indicated 
generalization  can  be  shown  to  work  with  no  additional  difficulties. 


Jg(v,  s;e 
(J4(v,  s;< 
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has  a smooth  solution  (v(e),  a(e))  for  each  ) e J ^ e0  and  using  this 
solution  in  (5.10a)  yields  the  bifurcating  branch  of  solutions. 

In  solving  (5.15c)  we  never  use  e = 0 so  that  even  when  employ- 
ing Newton's  method  second  derivatives  need  not  be  computed.  Fur- 
ther we  can  use  the  device  of  Rheinboldt  and  replace  [ u,  (s  ),  > [ (s )] 
in  ( 5.  10a)  and  (5.  14a)  by  [ u j (s ),  > j (s )]  of  (5 . 13 ).  The  implicit  func  - 
tion  theorem  still  holds  as  above  and  now  we  get  a solution 
(v(e),  cr(e))  of  the  modified  (5.15c)  to  use  in  the  modified  form  of 
(5.10a).  The  rigorous  justification  of  this  procedure  is  straightfor- 
ward and  obviously  the  method  can  be  made  constructive,  [10]. 

6.  Numerical  Methods 

To  apply  the  previously  indicated  procedures  we  must  assume 
that  stable,  convergent  numerical  methods  are  known  for  approxima- 
ting the  solutions  of  the  linearized  problems  which  arise  in  Newton's 
method  or  in  the  chord  method.  Indeed  even  more  is  required  to 
rigorously  justify  the  numerical  methods  used  in  switching  branches  at 
a bifurcation  point.  We  must  be  assured  that  bases  for  A'(G^)  and 
A'(g°u*)  can  be  accurately  determined.  For  the  case  of  simple  bifur- 
cation this  is  not  very  difficult  since  the  theory  of  numerical  methods 
for  computing  eigenfunctions  belonging  to  simple  eigenvalues  for 
broad  classes  of  linear  operators  is  well  developed.  Indeed  the  only 
works  thus  far  to  justify  numerical  methods  at  bifurcation  points  con- 
sider simple  bifurcation  from  a tirvial  branch,  [ 2,  22,  23].  We  shall 
not  present  a general  convergence  theory  here  but  rather  indicate  the 
practical  aspects  in  actually  carrying  out  our  procedures. 

Basically  the  problem  (2.4)  is  discretized  in  some  form  which 
we  indicate  by: 

(6.1)  a)  Gh(uh,  >h)=0  , Nh(uh>  >h,  s)  = 0 . 

Here  (u^,  X^)  represents  the  approximation  to  (u,  >)  on  the  net  or 
family  of  nets  which  is  parametrized  by  h.  If  (2.  4)  has  a smooth  iso- 
lated solution  then  under  modest  assumptions  on  the  consistency  of 
[ G^,  N^]  with  [ G,  N]  and  on  the  stability  and  Lipschitz  continuity  of 
the  linearized  difference  operators,  say  the  general  theory  in 

[ 9]  assures  us  that  (6.  1)  has  a unique  solution  which  can  be  computed 


1 


BIFURCATION  AND  NONLINEAR  EIGENVALUE  PROBLEMS  377 


by  Newton's  method.  The  bulk  of  the  computations  occur  in  this  case 
and  we  write  the  difference  operators  linearized  about  (u^,  X^)  say 
in  the  form: 


(6.  2) 


From  the  comparison  with  A in  (2.  6)  we  see  that  is  a form  of 

difference  approximation  to  G^.  Indeed.  A^  is  in  general  a matrix 

of  (large)  order  h~m  when  the  basic  problem  is  formulated  in  £m. 

Similarly  b^  and  c^  are  column  and  row  vectors  approximating 

and  N , respectively,  while  d,  is  a scalar  approximating  N.  . 
u n a 

The  basic  computational  problem  is  to  solve  linear  systems  in 
the  form 


To  do  this  we  need  only  determine  y^  and  z^  satisfying 
(6.4)  a)  Ahyh  =bh  , b)  Ahzh  = rh  ; 


and  then; 

(6.4)  C)  6>h  = <chZh  + Ph)/(dh  - c*yh)  , d)  6uh  = zh  - 6>hyh  . 

Of  course  we  solve  (6.  4a,  b)  by  some  Gaussian  elimination  procedure 
with  some  form  of  pivoting  to  get  (neglecting  the  permutations  for 
clarity  of  presentation) 


(6.4)  e) 


= LhUh 


Since  the  bulk  of  the  computations  occur  in  determining  the  factoriza- 
tion (6.  4e)  it  follows  that  our  normalization  procedure  costs  very  little 
extra  effort.  Now  we  sketch  an  algorithm  for  generating  "all"  the 
solution  branches  of  (1.1)  using  the  indicated  techniques  and  assuming 
that  only  simple  bifurcation  occurs. 

ALGORITHM 


i)  Using  Euler-Newton  continuation  generate  an  approximate 
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solution  arc:  xlh(s)  s (ulh(s),  Xlh(s)),  skipping  over  any  "singular" 
points  that  are  encountered. 

ii)  Return  to  the  neighborhood  of  each  singular  point  and  locate 

it  accurately  (i.  e.  use  false  position  or  bisection  to  determine  the 
zero,  sqj1>  det  A^(s)).  In  particular  simple  roots  or  more  ge- 

nerally odd  order  roots  are  easily  sensed  by  the  sign  change  in 

det  A^.  However  one  must  remember  to  account  for  the  row  or 
column  interchanges  in  the  LU-decomposition. 

iii)  Test  for  limit  point  or  bifurcation  at  each  singularity.  To  do 

...  , . * * * 
this  we  need  an  approximation  ^ to  4^  , the  null  vector  of  . 

We  do  this  as  we  also  compute  <j)^  an  approximation  to  4>j  the  null 

vector  of  G^.  This  is  easily  and  efficiently  done  by  inverse  iteration. 

In  fact  if  we  are  really  close  to  a singular  point  then  it  suffices  to  use, 

say: 


a) 

(6.5) 

b) 


Ah\  = ' 

Ah\  = UhLX  = ^h  : 


where  <j)^  = 6u^  is  the  last  correction  in  the  Newton  scheme  used  to 
compute  u^(sQh).  As  we  already  have  the  LU-decomposition  in  (6.  4) 
these  calculations  are  not  costly. 

The  test  is  in  the  form: 


= 0 


seek  bifurcation 


(6.6) 


4bh 


4 0 , a limit  point, 

iv)  To  switch  over  to  a bifurcating  branch  we  must  also  compute 
[ u.,  (s),  X.,(s)],  an  approximation  to  the  tangent  to  the  solution 


branch,  x^fs),  at  the  point  snh  best  approximating  the  bifurcation 


Oh 


point.  However,  this  will  have  been  computed  in  step  i)  or  ii)  if,  as 
we  assume,  the  normalization  N3  of  (2.  lOe)  has  been  employed. 
Then  we  can  easily  determine  an  approximation  to  (fo  of  (5.  2) 

as  follows.  We  set 

(6.7)  a)  * * ' ‘ * 

and  then  compute 
(6.7)  c) 


a0h  ~ >lh(sOh) 


b)  alh  = 4ih  Glh(sOh)/ltJlh  +lh  ' 


"Oh 


Oh 


[(xlh{s0h]  ' alh  V 
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[ Of  course  if  a_,  = 0 we  cannot  use  this  simple  procedure.  In  its 
1 Oh 

place  we  must  use  a solution  of  (6.  4a)  at  s = sq^  and  subtract  a mul- 
tiple  of  <(>oh  so  that  ^lh  4>oh  = J 

Now  to  use  Method  I we  approximate  anl(6)  and  bn(6)  of  (5.  7b) 


by: 


a) 


b) 


(6.  8) 


lllh 


= ^lh  V Ah(ulh(sOh)  + 6<t>lh’  >lh(s0h)) 
" Ah(ulh(s0h)’ >lh(sOh))^lh 
bllh(6)  = +lh  6 1 Ah(ulh(b0h)  + b<hh'  Xlh(sOh>) 
' Ah(ulh(s0h)’  Xlh(s0h^  ^Oh 
+ ^lh  V bh(ulh(sOh)+64>lh’  Xlh(s0h)) 


' bh(ulh(s0h)’  X(s0h)^  • 


Then  we  approximate  the  other  root  of  (5.  6a)  by  t“0h'QlJ  where 


The  tangent  to  the  bifurcated  branch  is  approximated  by: 

<6-9)  a2h(s0h>  = "lh  *lh  + *0h  ^Oh  ’ X2h(s0h)  =Q,0h  • 

Using  (6.  9)  in  the  normalization  N3  we  simply  return  to  step  i)  and 
generate  the  bifurcating  branch. 

To  use  Method  II  we  proceed  as  before  but  do  not  bother  with 
(6.8).  Rather  we  approximate  [a0.ai]  of  (5.  8a)  by: 


_ = - a-.JId),.  II2 
Oh  lh  rlh 


(6.10)  a) 

Then  we  seek-a  solution  of  (6.1)  in  the  form: 


®ih  =aoh(1  + ll+oh"*) 


(6.10)  b) 


uh  = ulh(s0h)  + et*0h*0h  + “lhV  + Vh 

Xh  = Xlh(s0h)  + C “Oh  + \ : 


where  N.  (•  ) is  taken  as: 
h 


(6.10)  c) 


Nh(uh-  V = ("0h*0h  + “lh+lh)vh  + “ 


Oh  ^h' 


Once  a solution  f v,  , rj.  ] is  obtained  a new  tangent  vector  is  computed 
h h 

and  we  return  to  step  i)  using  the  normalization  N3  . Indeed  the  above 
indicated  computations  involve  but  minor  modifications  from  the 
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procedure  of  step  i). 

Method  III  has  been  discussed  in  more  detail  by  Rheinboldt  [ 19 ] . 

It  has  also  been  used  for  bifurcation  from  the  trivial  solution  in 
[ 15,  22,  23], 

To  our  knowledge  the  new  Method  IV  has  not  yet  been  used  in 
actual  calculations.  However  it  is  in  the  process  of  being  tested  at 
the  present  time. 

7.  A Simple  Example. 

We  have  used  the  procedures  of  §6  on  several  examples  of  the 
form: 

(7.1)  a)  uxx  + f(x,  u;> ) = 0 , u(0)  = u(l)  = 0 , 

where 

(7.  1)  b)  f(x,  u;X)  = 2 q(X)  + 7r2  X p(u-q(X  )x(l-x))  . 

If  p(0)  = 0 then  a solution  of  (7.  1)  is  given  by 

(7.  2)  u,(x,X)  = q(X ) x(l-x)  , X = arb  . 

The  linearized  problem  about  u^x)  is 

(7.3)  + *2>Pu(°)<t>  = 0 • <(>(0)  = <t>(D  = 0 . 

Thus  if  Pu(0)  = 1 the  eigenvalues  of  (7.  3)  are 
(7.4  ) Xk  = k2  , k = 1,  2 

We  have  used  several  choices  for  q(X  ) and  p(z)  but  we  show  here,  in 
figure  3,  the  computed  results  for  the  choice 

(7.  5)  q(X  ) = X2  e , p(z)  2 z + z2 

The  difference  scheme  used  was  the  Collatz  Mehrstellenve rfah ren 
which  is  C\ h4  ) accurate  and  the  net  spacing  was  h = 1/20.  The  pro- 
cedures of  §6  were  applied  using  the  pseudoarclength  normalizations 
N2  and  N3  and  Method  II  was  used  to  switch  branches  at  bifurcation 
points.  (The  norm,  II x II 2 was  allowed  to  go  "negative"  if  it  went  to 
zero  along  a branch;  thus  is  a smooth  curve  on  figure  3,  ) The 

only  initial  guess  required  was  [ u,  X]  = [ 0,  0]  employed  near  X = 0 
to  start  the  Euler-Newton  continuation  on  the  branch  To  which  is  the 
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basic  solution  Uj(x,  >).  This  branch  was  easily  computed  and  sign 

changes  in  det  A^(X  ) were  noted  near  the  first  three  eigenvalues. 

Upon  refining  the  location  of  these  potential  bifurcation  or  limit  points, 

using  false  position,  the  eigenvector  approximations  d>  and  iL  were 

^ h h 

computed.  The  test  ijj  = 0 indicated  bifurcation  at  each  point  and 
Method  II  easily  switched  to  the  branch  T]  , IJ  or  r3  bifurcating  from 
the  corresponding  point  >2  - lt  2 or  3.  These  branches  were  extended 
in  either  direction  by  simply  changing  the  sign  of  the  e used  in  de- 
fining the  normal  vector  in  Method  II.  Then  continuation  generated 
the  branch.  On  two  of  these  extensions,  T]  and  J}  , new  zeros  of 
det  A^(A)  were  found  but  they  failed  the  necessary  test  for  bifurca- 
tion. Indeed  we  see  in  figure  3 that  they  are  simple  limit  points.  The 
branch  Ta  as  shown  in  figure  3 is  actually  covered  twice  but  it  does 
not  show  on  the  figure  due  to  symmetry.  There  is  a fundamental  dif- 
ference between  bifurcations  from  odd  and  even  "eigenvalues"  but 
our  scheme  for  computing  has  no  difficulties  with  either  case. 

We  also  started  our  procedure  at  a remote  point  on  the  bifurcated 
branch  T]  . It  of  course  located  the  basic  solution  To  as  a bifurca- 
tion from  this  branch  and  then  proceeded  to  find  the  remainder  of  the 
branches  in  figure  3.  To  completely  automate  our  procedure  we  would 
have  to  devise  step  control  techniques  to  allow  optimum  steps  in  the 
arclength  parameter,  s.  Also  net  selection,  variable  order  (via 
deferred  corrections  or  Richardson  extrapolation)  and  accuracy  tests 
should  be  included.  However  further  testing  with  Methods  I-IV 
should  be  carried  out  before  general  purpose  codes  are  seriously  con- 
templated. Furthermore  since  the  bulk  of  the  computations  occur  in 
the  continuation  process  the  choice  of  Euler-Newton  must  be  recon- 
sidered. Rheinboldt  [ 181  has  initiated  serious  studies  of  this  question. 
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